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(without alignments) 
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Searched: 1586107 seqs, 282547505 residues 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
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ALIGNMENTS 



RESULT 1 
AAB72500 

ID AAB72500 standard; peptide; 7 AA. 
XX 

AC AAB72500; 
XX 

DT 09-MAY-2001 (first entry) 
XX 

DE Colostrinin peptide #1. 
XX 

KW Dermatological; oxidative stress regulator; colostrinin. 
XX 

OS Unidentified. 
XX 

PN WO200112650-A2. 
XX 



PD 22-FEB-2001. 
XX 

PF 17-AUG-2000; 2 000WO-US022665 . 
XX 

PR 17-AUG-1999; 99US-014 931 OP . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Stanton GJ, Hughes TK, Boldogh I; 
XX 

DR WPI; 2001-218342/22. 
XX 

PT Modulating oxidative stress level in a cell, involves contacting the cell 

PT with an oxidative stress regulator selected from colostrinin, its 

PT constituent peptide, analog or their combinations. 
XX 

PS Claim 6; Page 25; 48pp; English. 
XX 

CC The present invention relates to a method for modulating the oxidative 

CC stress level in a cell or a patient, comprising contacting the cell with, 

CC or administering to the patient, an oxidative stress regulator selected 

CC from colostrinin, or its constituent peptide (e.g. the present peptide), 

CC to change the level of an oxidising species in the cell. The method can 

CC be used to treat oxidative damage to skin, by decreasing or preventing an 

CC increase in the level of damage to a biomolecule of the patient 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 42; DB 4; Length 7; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I M 

Db 1 MQPPPLP 7 



RESULT 2 




AAB59324 




ID 


AAB59324 standard; peptide; 7 AA. 




XX 






AC 


AAB59324; 




XX 






DT 


21-MAR-2001 (first entry) 




XX 






DE 


Ewe colostrinin peptide fragment 


B-9. 


XX 






KW 


Sheep; colostrinin; proline rich 


polypeptide; colostrum; immune disorder; 


KW 


central nervous system disorder; 


dietary supplement; beta-amyloid plaque. 


XX 






OS 


Ovis sp. 




XX 






PN 


WO200075173-A2. 




XX 






PD 


14-DEC-2000. 




XX 






PF 


02-JUN-2000; 2000WO-GB002128 . 





XX 

PR 02-JUN-1999; 99GB-00012852 . 
XX 

PA (REGE-) REGEN THERAPEUTICS PLC. 
XX 

PI Georgiades JA; 
XX 

DR WPI; 2001-071058/08. 
XX 

PT Peptides having an N~terminal amino acid sequence isolated from 

PT colostrinin for treating e.g. disorders of the central nervous system and 

PT immune system, viral and bacterial infections, and diseases characterized 

PT by amyloid plaques . 

XX 

PS Claim 7; Page 27; 63pp; English. 
XX 

CC The present invention provides the sequences of a number of peptides 

CC found in ewe's colostrinin, Colostrinin is the proline-rich polypeptide 

CC fragment of colostrum. These peptides can be used in the treatment of 

CC central nervous system disorders such as senile dementia, Parkinson's 

CC disease, Alzheimer's disease, psychosis and neurosis, immune system 

CC disorders such as bacterial and viral infections, to improve the 

CC development of a child's immune system, as a dietary supplement, and to 

CC promote the dissolution of beta-amyloid plaques 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 42; DB 4; Length 7; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I I I 

Db 1 MQPPPLP 7 



RESULT 3 




AAB72246 




ID 


AAB72246 standard; peptide; 7 AA. 




XX 






AC 


AAB72246; 




XX 






DT 


14-MAY-2001 (first entry) 




XX 






DE 


Colostrinin derived cytokine inducing peptide 


SEQ ID 1. 


XX 






KW 


Colostrinin; immune response; cytokine; blood 


cell proliferation; 


KW 


central nervous system disorder; neurological 


diosrder; mental disorder; 


KW 


dementia; neurodegenerative disease; Alzheimer 


■*s disease; psychosis; 


KW 


neurosis; infection. 




XX 






OS 


Synthetic. 




XX 






PN 


WO200111937-A2. 




XX 






PD 


22-FEB-2001. 




XX 







PF 17-AUG-2000; 2 0 0 0WO-US022 8 1 8 . 
XX 

PR 17-AUG-1999; 99US-014 9311P . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 

PA (REGE-) REGEN THERAPEUTICS PLC. 

XX 

PI Stanton GJ, Hughes TK, Boldogh I, Georgiades J; 
XX 

DR WPI; 2001-202804/20. 
XX 

PT Inducing a cytokine and modulating an immune response, useful for 

PT treating central nervous system diseases and bacterial and viral 

PT infections, comprises administering colostrinin as an immunological 

PT regulator. 
XX 

PS Claim 1; Page 34; 50pp; English. 
XX 

CC Sequences AAB72246 - AAB72275 represent peptides derived from clostrinin, 

CC a proline rich polypeptide aggregate contained in colostrum. The peptides 

CC have immune response modulatory activity, and are capable of inducing 

CC cytokines . Colostrinin and its derived peptides are useful for inducing 

CC cytokine production, for modulating an immunological response and for 

CC inducing blood cell proliferation. The peptides are useful in the 

CC treatment of disorders of the central nervous system, neurological 

CC disorders, mental disorders, dementia, neurodegenerative diseases, 

CC Alzheimer's disease, motor neurone disease, psychosis, neurosis, chronic 

CC disorders of the immune system, bacterial and viral infections and 

CC acquired immunological deficiencies 

XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 42; DB 4; Length 7; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I i I 

Db 1 MQPPPLP 7 



RESULT 4 
7^AB72532 

ID AAB72532 standard; peptide; 7 AA. 
XX 

AC AAB72532; 
XX 

DT 09-MAY-2001 (first entry) 
XX 

DE Colostrinin peptide #1. 
XX 

KW Neuroprotective; neural cell differentiation regulator; colostrinin; 

KW colostrum. 

XX 

OS Unidentified. 
XX 

PN WO200112651-A2 . 



XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 

XX 
SQ 



22-FEB-2001. 

17-AUG-2000; 2 000WO-US022774 . 
17-AUG-1999; 99US-014 9633P . 
(TEXA ) UNIV TEXAS SYSTEM. 
Boldogh I ; 

WPI; 2001-226545/23. 

Use of colostrinin, its constituent peptide or analog as a neural cell 
regulator, for promoting neural cell differentiation and treating damaged 
neural cells in a patient. 

Claim 6; Page 21; 35pp; English. 

The present invention relates to a method for promoting neural cell 
differentiation and treating damaged neural cells, using colostrinin and 
colostrinin constituent peptides (e.g. the present peptide) as a neural 
cell regulator. Colostrinin is a polypeptide complex found in colostrum 

Sequence 7 AA; 



Query Match 100.0%; Score 42; DB 4; Length 7; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 
Matches 7; Conservative 0; Mismatches 0; Indels 

Qy 1 MQPPPLP 7 

I I I I I I I 
Db 1 MQPPPLP 7 



0; Gaps 



0; 



RESULT 5 
AA014577 

ID AA014577 standard; peptide; 7 AA. 
XX 

AC AA014577; 
XX 

DT 27-MAY-2002 (first entry) 
XX 

DE Neural cell regulatory colostrinin peptide 1. 
XX 

KW Neural, cell differentiation; neural cell regulator; colostrinin peptide; 
KW neural cell formation; proline-rich polypeptide aggregate; colostrum; 
KW neural cell treatment. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 
FT Modified-site 7 

FT /note= "Optional C-terminal amide" 

XX 

PN WO200213851-A1. 
XX 



PD 21-FEB-2002. 
XX 

PF 17-AUG-2000; 2 000WO-US022777 . 
XX 

PR 17-AUG-2000; 2 000WO-US022 77 7 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Boldogh I, Stanton JG, Hughes TK; 
XX 

DR WPI; 2002-269152/31, 
XX 

PT Promoting cell differentiation in a patient involves use of blood cell 

PT regulator selected from colostrinin^ its constituent peptide and/or 

PT analog. 
XX 

PS Claim 7; Page 21; 37pp; English. 
XX 

CC The invention comprises a method for promoting cell differentiation (e.g 

CC neural cell differentiation) . The method involves contacting cells with 

CC neural cell regulator (i.e. a colostrinin peptide) in order to change th 

CC cells in morphology to form neural cells. Colostrinin is a proline-rich 

CC polypeptide aggregate that is present in colostrum. The method of the 

CC invention is useful for promoting the differentiation of cells and for 

CC treating damaged neural cells in a patient. The present amino acid 

CC sequence represents a specifically claimed colostrinin peptide used in 

CC the method of the invention 
XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 42; DB 5; Length 7; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 MQPPPLP 7 

I I I I I I I 

Db 1 MQPPPLP 7 



RESULT 6 


AAM51036 


ID 


AAM51036 standard; peptide; 7 AA. 


XX 




AC 


AAM51036; 


XX 




DT 


30-MAY-2002 (first entry) 


XX 




DE 


Colostrinin constituent peptide. 


XX 




KW 


Colostrinin; colostrum; immunomodulator ; cardiovascular; 


KW 


blood cell regulator; cytokine inducer; beta-casein; human 


XX 




OS 


Homo sapiens . 


XX 




FH 


Key Location/Qualifiers 


FT 


Modified- site 7 


FT 


/note= "optional C-terminal amidation" 



XX 

PN WO200213849-A1. 
XX 

PD 21-FEB-2002 . 
XX 

PF 17-AUG-2000; 2 0 00WO-US0227 7 5 . 
XX 

PR . 17-AUG-2000; 2 0 00WO-US02277 5 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 

PA (REGE-) REGEN THERAPEUTICS PLC. 

XX 

PI Stanton GJ, Hughes TK, Boldogh I, Georgiades J; 
XX 

DR WPI; 2002-269150/31. 
XX 

PT Modulation of blood cell proliferation in a patient involves use of blood 

PT cell regulator selected from colostrinin, its constituent peptide and/or 

PT analog. 
XX 

PS Claim 1; Page 34; 54pp; English. 
XX 

CC The present sequence is that of a colostrinin constituent peptide that is 

CC preferred for use as an immunological regulator and as a blood cell 

CC regulator in claimed methods of the invention. It is classified as having 

CC a beta-casein homologue precursor. Methods are claimed for: inducing a 

CC cytokine in a cell by contact with an immunological regulator, where the 

CC cell is present in a cell culture, a tissue, an organ or an organism, and 

CC the cell is mammalian, including human; modulating an immune response in 

CC a cell by contact with the immunological regulator under conditions 

CC effective to induce a cytokine; modulating an immune response in a 

CC patient by administering an immunological regulator under conditions 

CC effective to induce a cytokine, where the immunological regulator is 

CC administered topically or as part of a dietary supplement, and where the 

CC immune response is specific or non specific, an interferon response or an 

CC antibody response; modulating blood cell proliferation by contacting 

CC blood cells with a blood cell regulator, where the blood cells are 

CC present in a cell culture or an organism, are mammalian or human, and 

CC where the blood cells are increased in number or differentiated; and a 

CC method for modulating blood cell proliferation in a patent. A claimed 

CC cytokine-inducing composition comprises a pharmaceutical carrier and an 

CC active agent such as the present peptide. Cytokines induced by this 

CC peptide in human leucocyte cultures include interf eron-gamma, tumour 

CC necrosis factor-alpha, interleukin-4 , interleukin-6, interleukin-10 and 

CC interleukin-12 . It was one of the best overall inducers in almost all 

CC cytokine and blood cell proliferation experiments conducted 

XX 

SQ Sequence 7 AA; 

Query Match 100.0%; Score 42; DB 5; Length 7; 
Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I M I I I I 

Db 1 MQPPPLP 7 



RESULT 7 
AAE20228 

ID AAE20228 standard; peptide; 7 AA. 
XX 

AC AAE2 022 8; 
XX 

DT 18-JUN-2002 (first entry) 
XX 

DE Colostrinin constituent peptide #1. 
XX 

KW Blood cell regulator; colostrinin; constituent peptide; oxidative stress; 

KW therapy; oxidative damage; skin; aging; wound healing; cell replacement; 

KW tissue; organ; cosmetic procedure; repair; regeneration; preservation; 

KW transplantation; implantation; dermatological ; vulnerary. 
XX 

OS Unidentified. 
XX 

FH Key Location/Qualifiers 

FT Modified-site 7 

FT /note= "Optionally C-terminal amide" 
XX 

PN WO200213850-A1. 
XX 

PD 21-FEB-2002. 
XX 

PF 17-AUG-2000; 2 00 0WO-US0227 7 6 . 
XX 

PR 17-AUG-2000; 2 000WO-US0227 7 6 . 
XX 

PA (TEXA ) UNIV TEXAS SYSTEM. 
XX 

PI Stanton GJ, Hughes TK, Boldogh I; 
XX 

DR WPI; 2002-269151/31. 
XX 

PT Composition useful for the modulation of blood cell proliferation in a 

PT patient comprises a blood cell regulator selected from colostrinin;. its 

PT constituent peptide and/or analog. 
XX 

PS Claim 6; Page 25; 51pp; English. 
XX 

CC The invention relates to a composition which comprises a blood cell 

CO regulator selected from colostrinin^ its constituent peptide and/or 

CC analogue. The invention is used for modulating the oxidative stress level 

CC in a cell e.g. mammalian or human cell present in a cell culture, tissue, 

CC organ, or organism; or for treating oxidative damage to the skin of a 

CC patient e.g. animal or human; to modulate oxidative stress during/ after 

CC a premature birth or normal birth, preventing/delaying aging in a 

CC patient, enhancing wound healing, and the reduction of side effects of 

CC cosmetic procedures. The method changes the level of an oxidising species 

CC in the cell, such as decreases or prevents increase in the level of 

CC damage to a biomolecule of the patient selected from DNA, protein and/or 

CC lipid, compared to the same conditions when the oxidative stress 

CC regulator is not present. The modulation of oxidative stress results in 

CC enhanced repair, regeneration, and replacement of cells, tissues and 

CC organs (e.g. kidney, liver, pancreas, skin, and the other internal and 



CC external organs), as well as enhanced preservation of such organs for 

CC transplantation^, implantation, or scientific research. The present 

CC sequence is a colostrinin constituent peptide 
XX 

SQ Sequence 7 A?V; 

Query Match 100.0%; Score 42; DB 5; Length 7; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I I I 
Db 1 MQPPPLP 7 



RESULT 8 


AAB59354 


ID 


AAB59354 standard; peptide; 8 AA, 


XX 




AC 


AAB59354; 


XX 




DT 


21-MAR-2001 (first entry) 


XX 




DE 


Ewe colostrinin peptide fragment derived sequence #14, 


XX 






Sheep; colostrinin; proline rich polypeptide; colostrum; immune disorder; 


KW 


central nervous system disorder; dietary supplement; beta-amyloid plaque. 


XX 




OS 


Ovis sp. 


XX 




PN 


WO200075173 -A2 . 


XX 




PD 


14-DEC-2U00 . 


XX 




"D IP 


Uz — J UrJ — ^ U U U ; Z U U UWO— boU Uz 1^ o . 


XX 




PR 


02-JUN-1999; 99GB-00012 852 . 


XX 




PA 


(REGE-) REGEN THERAPEUTICS PLC. 


XX 




PI 


Georgiades JA; 


XX 




DR 


WPI; 2001-071058/08. 


XX 




PT 


Peptides having an N-terminal amino acid sequence isolated from 


PT 


colostrinin for treating e.g. disorders of the central nervous system and 


PT 


immune system, viral and bacterial infections, and diseases characterized 


PT 


by amyloid plaques . 


XX 




PS 


Claim 8; Page 27; 63pp; English. 


XX 




CC 


The present invention provides the sequences of a number of peptides 


CC 


found in ewe's colostrinin. Colostrinin is the proline-rich polypeptide 


CC 


fragment of colostrum. These peptides can be used in the treatment of 


CC 


central nervous system disorders such as senile dementia, Parkinson's 


CC 


disease, Alzheimer's disease, psychosis and neurosis, immune system 


CC 


disorders such as bacterial and viral infections, to improve the 



CC development of a child's immune system, as a dietary supplement, and to 

CC promote the dissolution of beta-amyloid plaques 

XX 

SQ Sequence 8 AA; 

Query Match 100.0%; Score 42; DB 4; Length 8; 

Best Local Similarity 100.0%; Pred. No. 1.4e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

M I I I I I 
Db 2 MQPPPLP 8 



RESULT 9 
ABB67510 

ID ABB67510 standard; protein; 316 AA. 
XX 

AC ABB67510; 
XX 

DT 26-MAR-2002 (first entry) 
XX 

DE Drosophila melanogaster polypeptide SEQ ID NO 29322. 
XX 

KW Drosophila; developmental biology; cell signalling; insecticide; 

KW pharmaceutical. 

XX 

OS Drosophila melanogaster. 
XX 

PN WO200171042-A2 . 
XX 

PD 27-SEP-2001. 
XX 

PF 23-MAR-2001; 2 00 IWO-US 0092 31 . 
XX 

PR 23-MAR-2000; 2000US-0191637P . 

PR ll-JUL-2000; 2 00 OUS- 00 614 15 0 . 
XX 

PA (PEKE ) PE CORP NY. 
XX 

PI Venter JC, Adams M, Li PWD, Myers EW; 
XX 

DR WPI; 2001-656860/75. 

DR N-PSDB; ABL11613. 
XX 

PT New isolated nucleic acid detection reagent for detecting 1000 or more 

PT genes from Drosophila and for elucidating cell signaling and cell-cell 

PT interactions. 
XX 

PS Disclosure; SEQ ID NO 29322; 21pp + Sequence Listing; English. 
XX 

CC The invention relates to an isolated nucleic acid detection reagent 

CC capable of detecting 1000 or more genes from Drosophila. The invention is 

CC useful in developmental biology and in elucidating cell signalling and 

CC cell-cell interactions in higher eukaryotes for the development of 

CC insecticides, therapeutics and pharmaceutical drugs. The invention 

CC discloses genomic DNA sequences (ABL16176-ABL30511) , expressed DNA 



CC sequences (ABL01840-ABL16175) and the encoded proteins (ABB57737- 

CC ABB72072) . The sequence data for this patent did not form part of the 

CC printed specification, but was obtained in electronic format directly 

CC from WIPO at ftp.wipo.int/pub/published_pct_sequences 

XX 

SQ Sequence 316 AA; 

Query Match 100.0%; Score 42; DB 4; Length 316; 

Best Local Similarity 100.0%; Pred. No. 2.6e+02; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 1 MQPPPLP 7 

I I I I I I I 

Db 249 MQPPPLP 255 



RESULT 10 
AAU45697 

ID AAU45697 standard; protein; 95 AA. 
XX 

AC AAU45697; 
XX 

DT 27-FEB-2002 (first entry) 
XX 

DE Propionibacterium acnes immunogenic protein #6593. 
XX 

KW SAPHO syndrome; synovitis; acne; pustulosis; hypertosis; osteomyelitis 

KW uveitis; endophthalmitis; bone; joint; central nervous system; ELISA; 

KW inflammatory lesion; acne vulgaris; enzyme linked immunosorbent assay; 

KW dermatological ; osteopathic; neuroprotectant . 
XX 

OS Propionibacterium acnes . 
XX 

PN WO200181581-7V2. 
XX 

PD Ol-NOV-2 001. 
XX 

PF 20-APR-2001; 2001WO-US012865 . 
XX 

PR 21-APR-2000; 2000US-0199047P . 

PR 02-JUN-2000; 2000US-0208841P . 

PR 07-JUL-2000; 2000US-0216747P . 
XX 

PA (CORI-) CORIXA CORP. 
XX 

PI Skeiky YAW, Persing DH, Mitcham JL, Wang SS, Bhatia A; 

PI L 'maisonneuve J, Zhang Jen S, Carter D; 

XX 

DR WPI; 2001-616774/71. 

DR N-PSDB; AAS59526. 
XX 

PT Propionibacterium acnes polypeptides and nucleic acids useful for 

PT vaccinating against and diagnosing infections, especially useful for 

PT treating acne vulgaris. 
XX 

PS Claim 6; SEQ ID NO 6892; 1069pp; English. 
XX 



CC Sequences AAU39105-AAU68 017 represent Propionibacterium acnes immunogenic 

CC polypeptides. The proteins and their associated DNA sequences are used in 

CC the treatment, prevention and diagnosis of medical conditions caused by 

CC P. acnes. The disorders include SAPHO syndrome (synovitis, acne, 

CC pustulosis, hypertosis and osteomyelitis), uveitis and endophthalmitis. 

CC P. acnes is also involved in infections of bone, joints and the central 

CC nervous system, however it is particularly involved in the inflammatory 

CC lesions associated with acne vulgaris. A method for detecting the 

CC presence or absence of P. acnes in a patient comprises contacting a 

CC sample with a binding agent that binds to the proteins of the invention 

CC and determining the amount of bound protein in the sample. The 

CC polypeptides may be used as antigens in the production of antibodies 

CC specific for P. acnes proteins. These antibodies can be used to 

CC downregulate expression and activity of P. acnes polypeptides and 

CC therefore treat P. acnes infections. The antibodies may also be used as 

CC diagnostic agents for determining P. acnes presence, for example, by 

CC enzyme linked immunosorbent assay (ELISA) . Note: The sequence data for 

CC this patent did not form part of the printed specification, but was 

CC obtained in electronic format directly from WIPO at 

CC ftp . wipo . int/pub/published_pct_sequences 

XX 

SQ Sequence 95 AA; 

Query Match 92.9%; Score 39; DB 4; Length 95; 

Best Local Similarity 85.7%; Pred. No. 2.3e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I : I I I I I 
Db 8 8 MEPPPLP 94 



RESULT 11 


ABM42216 


ID 


ABM42216 standard; protein; 95 AA. 


XX 




AC 


ABM42216; 


XX 




DT 


20-OCT-2003 (first entry) 


XX 




DE 


Propionibacterium acnes immunogenic polypeptide #6892. 


XX 




KW 


Acne vulgaris; antiseborrhoeic; dermatological ; antibacterial; 


KW 


immunostimulant ; immune response; vaccine; immunogenic. 


XX 




OS 


Propionibacterium acnes . 


XX 




PN 


WO2003033515-A1. 


XX 




PD 


24-APR-2003. 


XX 




PF 


11-OCT~2002; 2002WO-US032727 . 


XX 




PR 


15-OCT-2001; 2001US-00978825 . 


XX 




PA 


(CORI-) CORIXA CORP. 


XX 





PI Mitcham JL, Skeiky YAW, Parsing DH, Bhatia A, Maisonneuve JL; 

PI Zhang Y, Wang S, Jen S, Lodes MJ, Benson DR, Jones R, Carter D; 

PI Barth B, Vallieve-Douglass J; 

XX 

DR WPI; 2003-381789/36. 

DR N-PSDB; ACF64455. 
XX 

PT New Propionibacterium acnes polypeptides and polynucleotides encoding the 

PT polypeptide, useful for diagnosing, preventing or treating acne vulgaris, 

PT or for stimulating an immune response specific for a P. acnes protein. 
XX 

PS Claim 6; SEQ ID NO 6892; 1481pp; English. 
XX 

CC The invention relates to an isolated polynucleotide (ACF64435-ACF64733) 

CC encoding a Propionibacterium acnes protein. The invention also relates to 

CC polypeptides encoded by the polynucleotides (ABM3562 4-ABM64 53 6) and to 

CC immunogenic fragments of P. acnes polypeptides. The invention 

CC additionally encompasses expression vectors and host cells comprising a 

CC polynucleotide of the invention; antibodies against polypeptides of the 

CC invention; fusion proteins comprising a polypeptide of the invention; a 

CC method for stimulating an immune response specific for a P. acnes 

CC polypeptide and an isolated T cell population comprising T cells prepared 

CC via this method; a vaccine composition (comprising P. acnes polypeptides, 

CC polynucleotides, antibodies, fusion proteins, T cell populations, or 

CC antigen-presenting cells that express the polypeptide) ; a method and kit 

CC for detecting or determining the presence or absence of P. acnes in a 

CC patient; and a method for inhibiting the development of P. acnes in a 

CC patient. The P. acnes polypeptides, polynucleotides, antibodies, fusion 

CC proteins, T cell populations or antigen-presenting cells that express the 

CC polypeptides are useful for diagnosing, preventing or treating acne 

CC vulgaris, or for stimulating an immune response specific for a P. acnes 

CC protein. The polynucleotides can also be used as probes or primers for 

CC nucleic acid hybridisation. The vaccine composition is useful for the 

CC stimulation of an immune response against P. acnes, or for treating acne, 

CC and the kit is useful for performing a diagnostic assay. The present 

CC sequence represents a specifically claimed P. acnes polypeptide which is 

CC thought to contain an immunogenic region. Note: The sequence data for 

CC this patent did not form part of the printed specification, but was 

CC obtained in electronic format directly from WIPO at 

CC ftp . wipo . int/ pub/ published_pct_sequences 

XX 

SQ Sequence 95 AA; 

Query Match 92.9%; Score 39; DB 6; Length 95; 

Best Local Similarity 85.7%; Pred. No. 2.3e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I : I I I I I 
Db 88 MEPPPLP 94 



RESULT 12 
AAM17529 

ID AAM17529 standard; protein; 132 AA. 
XX 

AC AAM17 52 9; 



XX 

DT 12-OCT-2001 (first entry) 
XX 

DE Peptide #3963 encoded by probe for measuring cervical gene expression. 
XX 

KW Probe; human; microarray; gene expression; cervical epithelial cell; 

KW cervical cancer . 
XX 

OS Homo sapiens. 
XX 

PN WO200157278-A2. 
XX 

PD 09-AUG-2001 . 
XX 

PF 30-JAN-2001; 2 001WO-US000670 . 
XX 

PR 04-FEB-2000; 

PR 2 6-MAY-2000; 

PR 30-JUN-2000; 

PR 03-AUG-2000; 

PR 21-SEP-2000; 

PR 27-SEP-2000; 

PR 04-OCT-2000; 
XX 

PA (MOLE-) MOLECULAR DYNAMICS INC. 
XX 

PI Penn SG, Hanzel DK, Chen W, Rank DR; 
XX 

DR WPI; 2001-488901/53. 
XX 

PT Human genome-derived single exon nucleic acid probes useful for analyzing 

PT gene expression in human cervical epithelial cells. 

XX 

PS Claim 27; SEQ ID NO 22355; 487pp; English. 
XX 

CC The present invention relates to human single exon nucleic acid probes 

CC (SENP: see AAI10068-AAI28459 ) . The present sequence is a peptide encoded 

CC by one such probe. The SENPs are derived from human HeLa cells. The SENPs 

CC can be used to produce a single exon microarray, which can be used for 

CC measuring human gene expression in a sample derived from human cervical 

CC epithelial cells. By measuring gene expression, the probes are therefore 

CC useful in grading and/or staging of diseases of the cervix, notably 

CC cervical cancer. Note: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 132 AA; 

Query Match 92.9%; Score 39; DB 4; Length 132; 
Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 

Db 116 LQPPPLP 122 



2000US-0180312P. 
2000US-0207456P. 
2000US-00608408. 
2000US-00632366. 
2000US-0234687P. 
2000US-0236359P. 
2000GB-00024263. 



RESULT 13 
ABB36552 

ID ABB36552 standard; peptide; 132 AA, 
XX 

AC ABB36552; 
XX 

DT 04-FEB-2002 (first entry) 
XX 

DE Peptide #4058 encoded by human foetal liver single exon probe. 
XX 

KW Human; foetal liver; gene expression; single exon nucleic acid probe. 
XX 

OS Homo sapiens. 
XX 

PN WO200157277-A2. 
XX 

PD 09-AUG-2001. 
XX 

PF 30-JAN-2001; 2 0 OIWO-USOOO 669 . 
XX 

PR 04-FEB-2000; 2 0 OOUS- 0 18 03 12 P . 

PR 26-MAY-2000; 2000US-0207456P . 

PR 30-JUN-2000; 2000US-00608408 . 

PR 03-AUG-2000; 2000US-00632366 . 

PR 21-SEP-2000; 2 OOOUS-02 34 687P . 

PR 27-SEP-2000; 2000US-0236359P . 

PR 04-OCT-2000; 2 00 OGB-00 02 42 63 . 
XX 

PA (MOLE-) MOLECULAR DYNAMICS INC. 
XX 

PI Penn SG, Hanzel DK, Chen Rank DR; 
XX 

DR WPI; 2001-483447/52. 
XX 

PT Human genome-derived single exon nucleic acid probes useful for analyzing 

PT gene expression in human fetal liver. 

XX 

PS Claim 27; SEQ ID NO 29187; 639pp + Sequence Listing; English. 
XX 

CC The invention relates to a single exon nucleic acid probe for measuring 

CC human gene expression in a sample derived from human foetal liver. The 

CC single exon nucleic acid probes may be used for predicting, measuring and 

CC displaying gene expression in samples derived from human fetal liver. The 

CC present sequence is a peptide encoded by a single exon nucleic acid probe 

CC of the invention. Note: The sequence data for this patent did not form 

CC part of the printed specification, but was obtained in electronic format 

CC directly from WIPO at ftp.wipo.int/pub/published_pct_sequences 
XX 

SQ Sequence 132 AA; 

Query Match 92.9%; Score 39; DB 4; Length 132; 
Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I M I I 

Db 116 LQPPPLP 122 



RESULT 14 
AAM30051 

ID AAM30051 standard; protein; 132 AA. 
XX 

AC 7^AM30051; 
XX 

DT 17-OCT-2001 (first entry) 
XX 

DE Peptide #4088 encoded by probe for measuring placental gene expression. 
XX 

KW Probe; microarray; human; placenta; antenatal diagnosis; 

KW genetic disorder, 

XX 

OS Homo sapiens. 
XX 

PN WO200157272-A2. 
XX 

PD 09-AUG-2001. 
XX 

PF 30-JAN-2001; 2001WO-US000663 . 
XX 

PR 04-FEB-2000; 2000US-0180312P . 

PR 26-MAY-2000; 2000US-0207456P . 

PR 30-JUN-2000; 2000US-006084 08 . 

PR 03-AUG-2000; 2000US-00632366 . 

PR 21-SEP-2000; 2000US-0234 687P . 

PR 27-SEP-2000; 2000US-0236359P . 

PR 04-OCT-2000; 2000GB-00024263 . 
XX 

PA (MOLE-) MOLECULAR DYNAMICS INC. 
XX 

PI Penn SG, Hanzel DK, Chen Rank DR; 
XX 

DR WPI; 2001-488897/53. 
XX 

PT Human genome-derived single exon nucleic acid probes useful for analyzing 

PT gene expression in human placenta. 

XX 

PS Claim 27; SEQ ID NO 30320; 654pp; English. 
XX 

CC The present invention relates to single exon nucleic acid probes (SENP: 

CC see AAI31315-AAI57546) . The present sequence is a peptide encoded by one 

CC such probe. The probes are useful for producing a microarray for 

CC predicting, measuring and displaying gene expression in samples derived 

CC from human placenta. The probes are useful for antenatal diagnosis of 

CC human genetic disorders 

XX 

SQ Sequence 132 AA; 

Query Match 92.9%; Score 39; DB 4 ; Length 132; 
Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 MQPPPLP 7 
: I I I I I I 



Db 



116 LQPPPLP 122 



RESULT 15 
ABB31347 

ID ABB31347 standard; peptide; 132 AA. 
XX 

AC ABB31347; 
XX 

DT Ol-FEB-2002 (first entry) 
XX 

DE Peptide #3998 encoded by breast cell single exon nucleic acid probe, 
XX 

KW Human; microarray; single exon probe; gene expression; breast; disease; 

KW cancer. 

XX 

OS Homo sapiens. 
XX 

PN WO200157271-A2. 
XX 

PD 09-AUG-2001. 
XX 

PF 30-J7i^-2001; 2 001WO-US000662 . 
XX 

PR 04-FEB-2000; 2 OOOUS-01 8 0312 P . 

PR 26-MAY-2000; 2000US-0207456P . 

PR 30-JUN-2000; 2000US-006084 08 . 

PR 03-AUG-2000; 2 OOOUS- 00632 366 . 

PR 21-SEP-2000; 2 OOOUS- 02 34 68 7 P . 

PR 27-SEP-2000; 2000US-0236359P . 

PR 04-OCT-2000; 2 00 OGB-000242 63 . 
XX 

PA (MOLE-) MOLECULAR DYNAMICS INC. 
XX 

PI Penn SG, Hanzel DK, Chen Rank DR; 
XX 

DR WPI; 2001-496933/54. 
XX 

PT New spatially-addressable set of single exon nucleic acid probes, useful 

PT for measuring gene expression in sample derived from human breast, 

PT comprises number of single exon nucleic acid probes. 
XX 

PS Claim 27; SEQ ID NO 14315; 327pp + Sequence Listing; English. 
XX 

CC The invention relates to a spatially-addressable set of single exon 

CC nucleic acid probes for measuring gene expression in a sample derived 

CC from human breast and BT 474 cells. The method involves contacting the 

CC probes with a collection of detectably labelled nucleic acids derived 

CC from mRNA of human breast, and then measuring the label bound to each 

CC probe of the microarray. The probes are useful for verifying the 

CC expression of regions of genomic DNA predicted to encode proteins. They 

CC are useful for gene discovery, and for determining predisposition and/or 

CC prognosing breast disease. Gene expression analysis is useful for 

CC assessing the toxicity of chemical agents on cells. The microarray of 

CC this invention presents a far greater diversity of probes for measuring 

CC gene expression, with far less bias than expressed sequence tag 

CC microarrays. The method is suitable for rapid production of functional 



CC information from genomic sequence. The present sequence is a peptide 

CC encoded by a single exon nucleic acid probe of the invention. Note: The 

CC sequence data for this patent did not form part of the printed 

CC specification, but was obtained in electronic format directly from WIPO 

CC at f tp . wipo , int/pub/published_pct_sequences 

XX 

SQ Sequence 132 AA; 

Query Match 92.9%; Score 39; DB 4; Length 132; 

Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 116 LQPPPLP 122 



Search completed: August 24, 2004, 15:42:03 
Job time : 33.5224 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on : 



August 24, 2004, 15:33:13 ; Search time 7.6791 Seconds 

(without alignments) 
, 47.060 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-641-801-1 
42 

1 MQPPPLP 7 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 389414 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Issued_Patents_AA: ^ 

1 : / cgn2_6/ptodata/2/iaa/5A__COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: ^ 

3 : / cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : / cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep: * 

6: / cgn2_6/ptodata/2/iaa/backfilesl.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


42 


100, 


0 


7 


4 


US-09-641-803-1 


Sequence 


1, Appli 


2 


37 


88 . 


1 


278 


4 


US-08-339-214-16 


Sequence 


16, Appl 


3 


37 


88. 


1 


278 


4 


US-08-339-214-26 


Sequence 


26, Appl 


4 


37 


88. 


1 


278 


4 


US-09-131-237C-6 


Sequence 


6, Appli 


5 


37 


88. 


1 


479 


4 


US-09-2 52-99 lA-31536 


Sequence 


31536, A 


6 


37 


88. 


1 


642 


4 


US-09-252-991A-32104 


Sequence 


32104, A 


7 


37 


88. 


1 


667 


2 


US-08-718-661-2 


Sequence 


2, Appli 


8 


37 


88. 


1 


721 


4 


US-09-533-029-78 


Sequence 


7 8, Appl 


9 


36 


85. 


7 


349 


1 


US-08-118-270-71 


Sequence 


71, Appl 


10 


36 


85. 


7 


349 


5 


PCT-US93-08528-71 


Sequence 


71, Appl 


11 


35 


83. 


3 


23 


1 


US-08-268-251-34 


Sequence 


34, Appl 



12 


35 


83 


. 3 


23 


5 


PCT~US93- 


01112-34 


Sequence 


34, Appl 


13 


35 


83 


.3 
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4 


US- 


-09 


-388 


-349-9 


Sequence 


9, Appli 


14 


34 


81 


0 


76 


4 


US- 


-09 


-096 


-731A-14 


Sequence 


14, Appl 


15 


34 


81 


0 
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4 


us- 


-09 


-252 


-991A-24956 


Sequence 


24956, A 


16 


34 


81 


0 


194 


3 


us- 


-08 


-822 


-264-4 


Sequence 


4, Appli 


17 


34 


81 


0 


194 


4 


us- 


-09 


-565 


-808-21 


Sequence 


21, Appl 


18 


34 


81. 


0 


195 


2 


us- 


-08 


-960 


-022-6 


Sequence 


6, Appli 


19 


34 


81. 


0 


195 


4 


us- 


-09 


-565 


-808-2 


Sequence 


2, Appli 


20 


34 


81. 


0 


223 


3 


us- 


-08 


-822 


-264-3 


Sequence 


3, Appli 


21 


34 


81. 


0 


223 


4 


us- 


-09 


-565 


-808-22 


Sequence 


22, Appl 


22 


34 


81. 


0 


235 


4 


us- 


-09 


-370 


-950C-2 


Sequence 


2, Appli 


23 


34 


81. 


0 


236 


4 


us- 


-09 


-370 


-950C-3 


Sequence 


3, Appli 


24 


34 


81. 


0 
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4 


us- 


-09- 


-252 


-991A-20981 


Sequence 


20981, A 


25 


34 


81. 


0 


258 


4 


us- 


-09- 


-096 


-731A-18 


Sequence 


18, Appl 


26 


34 


81. 


0 


304 


4 


us- 


-09- 


-096- 


-731A-10 


Sequence 


10, Appl 


27 


34 


81. 


0 


309 


4 


us- 


-09- 


-489- 


-039A-11604 


Sequence 


11604, A 


28 


34 


81. 


0 


376 


4 


us- 


■09- 


-056- 


-556-202 


Sequence 


202, App 


29 


34 


81. 


0 


376 


4 


us- 


-09- 


-072- 


-596-197 


Sequence 


197, App 


30 


34 


81. 


0 


376 


4 


us- 


09- 


-072- 


-967-202 


Sequence 


202, App 


31 


34 


81. 


0 


409 


4 


us- 


09- 


-029- 


-755C-2 


Sequence 


2, Appli 


32 


34 


81. 


0 


608 


3 


us- 


09- 


-413- 


-814-92 


Sequence 


92, Appl 


33 


34 


81. 


0 


690 


4 


us- 


09- 


-252- 


-991A-16715 


Sequence 


16715, A 


34 


34 


81. 


0 


1006 


4 


us- 


09- 


-023- 


-905A-12 


Sequence 


12, Appl 


35 


34 


81. 


0 


1182 


4 


us- 


09- 


-293- 


-505-7 


Sequence 


7, Appli 


36 


34 


81. 


0 


1213 


3 


us- 


09- 


-413- 


-814-79 


Sequence 


79, Appl 


37 


33 


78. 


6 


10 


3 


us- 


08- 


-602- 


-999A-278 


Sequence 


278, App 


38 


33 


78. 


6 


10 


4 


us- 


09- 


-500- 


-124-278 


Sequence 


278, App 


39 


33 


78. 


6 


65 


4 


us- 


09- 


-621- 


-976-6387 


Sequence 


6387, Ap 


40 


33 


78. 


6 


95 


4 


us- 


09- 


-314- 


-268-132 


Sequence 


132, App 


4 1 


33 


78 . 


6 


96 


4 


us- 


09- 


-800- 


-729-179 


Sequence 


179, App 


42 


33 


78. 


6 


112 


4 


us- 


09- 


-621- 


-976-6045 


Sequence 


6045, Ap 


43 


33 


78. 


6 


146 


4 


us- 


09- 


-252- 


-991A-24993 


Sequence 


24993, A 


44 


33 


78. 


6 


207 


4 


us- 


09- 


-252- 


-991A-29063 


Sequence 


29063, A 


45 


33 


78. 


6 


324 


4 


us- 


08- 


-732- 


-749-4 


Sequence 


4, Appli 



ALIGNMENTS 



RESULT 1 
US-09-641-803-1 

; Sequence 1, Application US/09641803 

; Patent No. 6500798 

; GENERAL INFORMATION: 

; APPLICANT: STANTON, G. John 

; APPLICANT: HUGHES, Thomas K. 

; APPLICANT: BOLDOGH, Istvan 

; TITLE OF INVENTION: USE OF COLOSTRININ, CONSTITUENT PEPTIDES THEREOF, AND 

; TITLE OF INVENTION: ANALOGS THEREOF AS OXIDATIVE STRESS REGULATORS 

; FILE REFERENCE: 265.00220101 

; CURRENT APPLICATION NUMBER: US/09/641,803 

; CURRENT FILING DATE: 2000-08-17 

; PRIOR APPLICATION NUMBER: 60/149,310 

; PRIOR FILING DATE: 1999-08-17 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE: Patentin Ver. 2.1 

; SEQ ID NO 1 



LENGTH: 7 
; TYPE: PRT 

ORGANISM: Artificial Sequence 
; FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: synthetic 
OTHER INFORMATION: peptide 
US-09-641-803-1 



Query Match 100.0%; Score 42; DB 4; Length 7; 

Best Local Similarity 100.0%; Pred. No. 3e+05; 

Matches 7 ; Conservative 0 ; Mismatches 0 ; Indels 0 ; Gaps 0 ; 

Qy I MQPPPLP 7 

I I I I I I I 
Db 1 MQPPPLP 7 



RESULT 2 

US-08-339-214-16 

Sequence 16, Application US/08339214 
Patent No. 6348334 
GENERAL INFORMATION: 

APPLICANT: Nagata, Shigikazu 
APPLICANT: Suda, Takashi 
APPLICANT: Takahashi, Tomoniro 
APPLICANT: Nakamura, No. 6348334io 

TITLE OF INVENTION: A Fas Ligand, A Fragment Thereof and DNA 
TITLE OF INVENTION: Encoding the Same 
NUMBER OF SEQUENCES: 103 
CORRESPONDENCE ADDRESS; 

ADDRESSEE: Birch, Stewart, Kolasch & Birch 
STREET: P.O. Box 747 
CITY: Falls Church 
STATE: Virginia 
COUNTRY: USA 
ZIP: 22040-0747 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 0 8 /339 , 2 14 
FILING DATE: lO-NOV-1994 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Murphy Jr., Gerald M. 
REGISTRATION NUMBER: 28,977 
REFERENCE/DOCKET NUMBER: 1110-139P 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 703-205-8 000 
TELEFAX: 7 03-2 05-8 050 
INFORMATION FOR SEQ ID NO: 16: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 27 8 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 



MOLECULE TYPE: protein 
US-08-339-214-16 



Query Match 88.1%; Score 37; DB 4; Length 278; 

Best Local Similarity 100.0%; Pred. No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

M I I I I 

Db 60 QPPPLP 65 



RESULT 3 

US-08-339-214-26 

Sequence 26, Application US/08339214 
Patent No. 6348334 
GENERAL INFORMATION: 

APPLICANT: Nagata, Shigikazu 
APPLICANT: Suda, Takashi 
APPLICANT: Takahashi, Tomoniro 
APPLICANT: Nakamura, No. 6348334io 

TITLE OF INVENTION: A Fas Ligand, A Fragment Thereof and DNA 
TITLE OF INVENTION: Encoding the Same 
NUMBER OF SEQUENCES: 103 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Birch, Stewart, Kolasch & Birch 
STREET: P.O. Box 747 
CITY: Falls Church 
STATE: Virginia 
COUNTRY: USA 
ZIP: 22040-0747 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentin Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/339, 214 
FILING DATE: lO-NOV-1994 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
NAME: Murphy Jr., Gerald M. 
REGISTRATION NUMBER: 28,977 
REFERENCE/DOCKET NUMBER: 1110-139P 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 703-205-8 000 
TELEFAX: 703-205-8 05 0 
INFORMATION FOR SEQ ID NO: 26: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 27 8 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
US-08-339-214-26 



Query Match 88.1%; Score 37; DB 4; Length 278; 

Best Local Similarity 100.0%; Pred. No. 1.8e+02; 



Matches 



6; 



Conservative 



0; Mismatches 



0; Indels 



0; Gaps 



Qy 



2 QPPPLP 7 



Db 



60 QPPPLP 65 



RESULT 4 

US-09-1-31-237C-6 

; Sequence 6, Application US/09131237C 

; Patent No. 6599719 

; GENERAL INFORMATION: 

; APPLICANT: Yu, Guo-Liang 

; APPLICANT: Ni, Jian 

; APPLICANT: Rosen, Craig A. 

; TITLE OF INVENTION: Tumor Necrosis Factor Gamma 
; FILE REFERENCE: PF141P3 

; CURRENT APPLICATION NUMBER: US/09/131 , 237C 

; CURRENT FILING DATE: 1998-08-07 

; PRIOR APPLICATION NUMBER: 60/074,047 

; PRIOR FILING DATE: 1998-02-09 

; PRIOR APPLICATION NUMBER: 09/005,020 

; PRIOR FILING DATE: 1998-01-09 

; PRIOR APPLICATION NUMBER: 08/461,246 

; PRIOR FILING DATE: 1995-06-05 

PRIOR APPLICATION NUMBER: PCT/US94/ 12 8 8 0 
; PRIOR FILING DATE: 1994-11-07 

NUMBER OF SEQ ID NOS : 24 

SOFTWARE: Patentin Ver. 2.0 
; SEQ ID NO 6 

LENGTH: 278 
TYPE: PRT 

ORGANISM: Rattus norvegicus 
US-09-131-237C-6 

Query Match 88.1%; Score 37; DB 4; Length 278; 

Best Local Similarity 100.0%; Pred. No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 QPPPLP 7 

I I I I I I 

Db 60 QPPPLP 65 



US-09-252-991A-31536 

; Sequence 31536, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/ 0 9/2 52 , 99 lA 

; CURRENT FILING DATE: 1999-02-18 

; PRIOR APPLICATION NUMBER: US 60/074,788 



RESULT 5 



PRIOR FILING DATE: 1998-02-18 

PRIOR APPLICATION NUMBER: US 60/094,190 
; PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 31536 
; LENGTH: 479 
; TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-31536 

Query Match 88.1%; Score 37; DB 4; Length 479; 

Best Local Similarity 100.0%; Pred. No. 3.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 QPPPLP 7 

I I I I I I 

Db 193 QPPPLP 198 



RESULT 6 

US-09-252-991A-32104 

; Sequence 32104, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

CURRENT APPLICATION NUMBER: US/ 0 9/252 , 9 91A 
; CURRENT FILING DATE: 1999-02-18 
; PRIOR APPLICATION NUMBER: US 60/074,788 

PRIOR FILING DATE: 1998-02-18 

PRIOR APPLICATION NUMBER: US 60/094,190 
; PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS: 33142 
; SEQ ID NO 32104 
; LENGTH: 642 
TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-32104 

Query Match 8 8.1%; Score 37; DB 4; Length 642; 

Best Local Similarity 100.0%; Pred. No. 4.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 QPPPLP 7 

I I M I I 

Db 41 QPPPLP 46 



RESULT 7 
US-08-718-661-2 

; Sequence 2, Application US/08718661 
; Patent No. 5876972 
; GENERAL INFORMATION: 
APPLICANT: 



TITLE OF INVENTION: Nucleic acid molecules coding for mammalian 
TITLE OF INVENTION: tumor suppressor proteins and methods for their 
isolation 

; NUMBER OF SEQUENCES: 15 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: PatentIn Release #1.0, Version #1.30 (EPO) 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/ 08/7 1 8 , 661 

FILING DATE: 
CLASSIFICATION: 530 
; INFORMATION FOR SEQ ID NO: 2: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 667 amino acids 
; TYPE: amino acid 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
US-08-718-661-2 

Query Match 88.1%; Score 37; DB 2; Length 667; 

Best Local Similarity 100.0%; Pred, No. 4.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 527 QPPPLP 532 



RESULT 8 

US-09-533-029-78 

; Sequence 78, Application US/09533029 
; Patent No. 6664446 
; GENER7VL INFORMATION: 



APPLICANT: 


Heard, 


Jacqueline 


APPLICANT: 


Broun, 


Pierre 


APPLICANT: 


Riechmann, Jose-Luis 


APPLICANT: 


Keddie, 


James 


APPLICANT: 


Pineda, 


Omaira 


APPLICTi^T: 


Adam, Luc 


APPLICANT: 


Samaha, 


Raymond 


APPLICANT: 


Zhang, 


James 


APPLICANT: 


Yu, Guo 


-Liang 


APPLICANT: 


Ratclif 


fe, Oliver 


APPLICANT: 


Pilgrim 


, Marsha 


APPLICANT: 


Jiang, 


Cai-Zhong 


APPLICANT: 


Reuber , 


Lynne 



TITLE OF INVENTION: DISEASE-INDUCED POLYNUCLEOTIDES 
FILE REFERENCE: MBI-010 

CURRENT APPLICATION NUMBER: US/09/533,029 
CURRENT FILING DATE: 2000-03-22 
EARLIER APPLICATION NUMBER: 60/125,814 
EARLIER FILING DATE: 1999-03-23 
NUMBER OF SEQ ID NOS : 121 
SOFTWARE: PatentIn Ver. 2.1 
SEQ ID NO 78 



LENGTH: 721 
TYPE: PRT 

ORGANISM: Arabidopsis thaliana 
; FEATURE: 

OTHER INFORMATION: G3 8 5 
US-09-533-029-78 



Query Match 88.1%; Score 37; DB 4; Length 721; 

Best Local Similarity 85.7%; Pred. No. 4.6e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I I 

Db 205 MSPPPLP 211 



RESULT 9 

US-08-118-270-71 

; Sequence 71, Application US/08118270 

; Patent No. 5508384 

; GENERAL INFORMATION: 

APPLICANT: Murphy, Randall B. 

APPLICANT: Schuster, David I. 

TITLE OF INVENTION: POLYPEPTIDES OF G-COUPLED PROTEIN 

TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND METHODS THEREOF 
NUMBER OF SEQUENCES: 348 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BROWDY AND NEIMARK 
STREET: 419 Seventh Street, N.W., Suite 300 
CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
ZIP: 20004 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 118 , 27 0 
FILING DATE: 09-SEP-1993 
PRIOR APPLICATION DATA: 
; APPLICATION NUMBER: US 07/943,236 

FILING DATE: lO-SEP-1992 
ATTORNEY/AGENT INFORMATION: 
NAME: Townsend, Kevin G. 
REGISTRATION NUMBER: 34,033 
REFERENCE/DOCKET NUMBER: MURPHY=2A 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-628-5197 
TELEFAX: 202-737-3528 
; TELEX: 248633 

; INFORMATION FOR SEQ ID NO: 71: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 34 9 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: single 



TOPOLOGY: linear 
MOLECULE TYPE: peptide 
US-08-118-270-71 



Query Match 85.7%; Score 36; DB 1; Length 349; 

Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

Mill I 

Db 290 MQPPPKP 296 



RESULT 10 
PCT-US93-08528-71 

; Sequence 71, Application PC/TUS9308528 
; GENERAL INFORMATION: 

APPLICANT: New York University 
; TITLE OF INVENTION: POLYPEPTIDES OF G-COUPLED PROTEIN 

TITLE OF INVENTION: RECEPTORS, AND COMPOSITIONS AND METHODS THEREOF 
NUMBER OF SEQUENCES: 34 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: BROWDY AND NEIMARK 
; STREET: 419 Seventh Street, N.W., Suite 300 

CITY: Washington 
STATE: D.C. 
COUNTRY: USA 
ZIP: 20004 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentin Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: PCT/US93/ 08528 

FILING DATE: 09-SEP-1993 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/943,236 
FILING DATE: lO-SEP-1992 
; ATTORNEY/AGENT INFORMATION: 
NAME: Townsend, Kevin G. 
REGISTRATION NUMBER: 34,033 
REFERENCE/DOCKET NUMBER: MURPHY=2 PCT 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-628-5197 
TELEFAX: 202-737-3528 
TELEX: 248633 
INFORMATION FOR SEQ ID NO: 71; 
SEQUENCE CHARACTERISTICS: 
LENGTH: 34 9 amino acids 
; TYPE: amino acid 

STRANDEDNESS: single 
TOPOLOGY: linear 
MOLECULE TYPE: peptide 
PCT-US93-08528-71 



Query Match 



85.7%; Score 36; DB 5; Length 349; 



Qy 1 MQPPPLP 7 

1 1 1 1 1 I 

Db 16 MQPPPPP 22 



RESULT 12 
PCT-US93-01112-34 

; Sequence 34, Application PC/TUS9301112 
; GENERAL INFORMATION: 

APPLICANT: Jamieson, Gordon A 
APPLICANT: Dedman, John R 
; APPLICANT: Kaetzel, Marcia A 

TITLE OF INVENTION: Calmodulin-Binding Peptides 
; NUMBER OF SEQUENCES: 59 

CORRESPONDENCE ADDRESS: 
; ADDRESSEE: Chiron Corporation 

; STREET: 4560 Horton Street 

; CITY: Emeryville 

; STATE: CA 

COUNTRY: USA 
ZIP: 94608 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: WordPerfect 5.1 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US93/ 01112 
FILING DATE: 19930208 
; CLASSIFICATION: 

PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/831,219 
; FILING DATE: 06-FEB-1992 

; ATTORNEY/AGENT INFORMATION: 

NAME: Green, Grant D 
REGISTRATION NUMBER: 31,259 
REFERENCE/DOCKET NUMBER: 272.001 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (510) 601-2706 

TELEFAX: (510) 655-3542 
INFORMATION FOR SEQ ID NO: 34: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 23 amino acids 
TYPE: 7\MIN0 ACID 
STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
PCT-US93-01112-34 



Query Match 83.3%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 

Qy 1 MQPPPLP 7 

I I I I I I 
Db 16 MQPPPPP 22 



Score 35; DB 5; Length 23; 
Pred. No. 32; 
0; Mismatches 1; Indels 0; Gaps 0; 



Best Local Similarity 85.7%; Pred. No. 3.1e+02; 

Matches 6; Conservative 0; Mismatches 1; Indels 



0; Gaps 



0; 



Qy 1 MQPPPLP 7 

I I I I I I 

Db 290 MQPPPKP 296 



RESULT 11 
US-08-268-251-34 

; Sequence 34, Application US/08268251 
; Patent No. 5585475 

GENERAL INFORMATION: 

APPLICANT: Jamieson, Gordon A 
APPLICANT: Dedman, John R 
APPLICANT: Kaetzel, Marcia A 
; TITLE OF INVENTION: Calmodulin-Binding Peptides 

NUMBER OF SEQUENCES: 59 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Chiron Corporation 
; STREET: 4560 Horton Street 

; CITY: Emeryville 

STATE: CA 
; COUNTRY: USA 

ZIP: 94608 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: WordPerfect 5.1 
CURRENT APPLICATION DATA: 
; APPLICATION NUMBER: US/08/2 68 , 251 

; FILING DATE: 

CLASSIFICATION: 514 
; PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US 07/831,219 
FILING DATE: 0 6-FEB-1992 
ATTORNEY/AGENT INFORMATION: 
NAME: Green, Grant D 
REGISTRATION NUMBER: 31,259 
REFERENCE/DOCKET NUMBER: 272.001 
TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (510) 601-2706 

TELEFAX: (510) 655-3542 
INFORMATION FOR SEQ ID NO: 34: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 23 amino acids 
TYPE: amino acid 
STRANDEDNESS: single 
; TOPOLOGY: linear 

MOLECULE TYPE: peptide 
US-08-268-251-34 

Query Match 83.3%; Score 35; DB 1; Length 23; 

Best Local Similarity 85.7%; Pred. No. 32; 

Matches 6; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 



RESULT 13 
US-09-388-349-9 

; Sequence 9, Application US/09388349 

; Patent No. 6365370 

; GENERAL INFORMATION: 

; APPLICiWT: Dubin, Adreinne E 

; APPLICANT: Erlander, Mark G 

; APPLICANT: Huvar, Arne 

; APPLICANT: Huvar, Rene 

; APPLICANT: Buehler, Lukas K 

; TITLE OF INVENTION: DNA Encoding A Human Subunit 5-HT3-C of the 5-HT3 

Serotonin Receptor 

; FILE REFERENCE; ORT-103 9 

; CURRENT APPLICATION NUMBER: US/09/38 8 , 349 
CURRENT FILING DATE: 1999-09-01 
NUMBER OF SEQ ID NOS : 12 
SOFTWARE: PatentlnVer, 2.0 
; SEQ ID NO 9 

LENGTH: 447 
TYPE: PRT 

ORGTiJ^ISM: Homo sapiens 
US-09-388-349-9 



Query Match 83.3%; Score 35; DB 4; Length 447; 

Best Local Similarity 83.3%; Pred. No. 5.5e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I : I 

Db 339 QPPPMP 344 



RESULT 14 
US-09-096-731A-14 

; Sequence 14^ Application US/09096731A 

; Patent No. 6489454 

; GENERAL INFORMATION: 

; APPLICANT: Liu, Qiong 

; APPLICJWT: Hengartner, Michael O. 

; APPLICANT: Bogaert, Thierry Andre Oliver Eddy 

; APPLICANT: Van Criekinge, Wim Maria Rene 

TITLE OF INVENTION: ENGULFMENT GENE 7\ND USES THEREOF 

FILE REFERENCE: CSHL97-07pA 
; CURRENT APPLICATION NUMBER: US/09/096, 731A 

CURRENT FILING DATE: 1998-06-11 

PRIOR APPLICATION NUMBER: US 60/072,324 
; PRIOR FILING DATE: 1998-01-23 
; NUMBER OF SEQ ID NOS: 4 4 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 14 

LENGTH: 7 6 

TYPE: PRT 
; ORGANISM: Human 
; FEATURE: 

OTHER INFORMATION: Proline/Serine Rich Region 
US-09-096-731A-14 



Query Match 81.0%; Score 34; DB 4; Length 76; 

Best Local Similarity 83.3%; Pred. No. 1.4e+02; 

Matches 5; Conservative 1; Mismatches 0; Indels 0; Gaps 

Qy 2 QPPPLP 7 

I I I I : I 

Db 38 QPPPVP 4 3 



RESULT 15 

US-09-252-991A-24 956 

; Sequence 24956, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al . 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
PSEUDOMONAS 

; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/ 09/252 , 991A 

; CURRENT FILING DATE: 1999-02-18 

; PRIOR APPLICATION NUMBER: US 60/074,788 

; PRIOR FILING DATE: 1998-02-18 

; PRIOR APPLICATION NUMBER: US 60/094,190 

PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 24956 

LENGTH: 158 
; TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-24 956 

Query Match 81.0%; Score 34; DB 4; Length 158; 

Best Local Similarity 71.4%; Pred. No. 2.8e+02; 

Matches 5; Conservative 1; Mismatches 1; Indels 0; Gaps 

Qy 1 MQPPPLP 7 

: Mill 
Db 136 LSPPPLP 142 



Search completed: August 24, 2004, 15:55:10 
Job time : 9.6791 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search^ using sw model 



Run on: August 24, 2004, 15:26:28 ; Search time 6.79105 Seconds 

(without alignments) 
99.151 Million cell updates/sec 

Title: US-09-641-8 01-1 

Perfect score: 42 
Sequence: 1 MQPPPLP 7 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 283366 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIR_78:* 
1: pirl:* 
2: pir2:^ 
3: pir3:* 
4: pir4:+ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
G86280 

protein T5E21.13 [imported] - Arabidopsis thaliana 
C;Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 31-Mar-2001 
C; Access ion: G8 628 0 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, O.; 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; 
Chan, A.; Chao, Q.; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B. ; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B. ; Hansen, N. F. ; Hughes, B. ; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S,; Khaykin, E.; 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin- 
Hooper, S.; Lee, A.; Lee, J.M. ; Lenz, C.A.; Li, J.H.; Li, Y. ; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G.; Peterson, J.; Pham, P.K.; 
Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 



A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; 
Tallon, L.J.; Tambunga, G. ; Toriumi, M.J.; Town, CD.; Utterback, T, ; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D. ; Yu, G. ; Fraser, CM.; 
Venter, J.C; Davis, R.W. 

A;Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis . 

A; Reference number: A86141; MUID : 21016719 ; PMID : 111307 12 

A; Accession: G8 62 8 0 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-1776 <STO> 

A;Cross-references : GB:AE005172; NID: g7527720 ; PIDN : AAF63169 . 1 ; GSPDB : GN0014 1 
C; Genetics : 
A;Gene: T5E21.13 
A;Map position: 1 



Query Match 95.2%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 40; DB 2; Length 1776; 
Pred. No. 2.1e+02; 
1; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 1 MQPPPLP 7 

I I I I I : I 

Db 1643 MQPPPMP 1649 



RESULT 2 
E82756 

beta-galactosidase XF0840 [imported] - Xylella fastidiosa (strain 9a5c) 
C; Species: Xylella fastidiosa 

C;Date: 18-Aug-2000 #sequence__revision 20-Aug-2000 #text_change 02-Sep-2000 
C; Accession: E827 5 6 

R; anonymous, The Xylella fastidiosa Consortium of the Organization for 
Nucleotide Sequencing and Analysis, Sao Paulo, Brazil. 
Nature 406, 151-157, 2000 

A;Title: The genome sequence of the plant pathogen Xylella fastidiosa. 
A; Reference number: A82515; MUID: 20365717 ; PMID : 10910347 

A;Note: for a complete list of authors see reference number A59328 below 
A; Access ion: E82756 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-612 <SIM> 

A;Cross-references: GB:AE003923; GB:AE003849; NID : g9105736 ; PIDN : AAF83650 . 1 ; 

GSPDB:GN00128; XFSC:XF0840 

A; Experimental source: strain 9a5c 

R;Simpson, A.J.G.; Reinach, F.C; Arruda, P.; i^breu, F.A. ; Acencio, M. ; 
Alvarenga, R. ; Alves, L.M.C; Araya, J.E.; Baia, G.S.; Baptista, CS.; Barros, 
M.H.; Bonaccorsi, E.D.; Bordin, S.; Bove, J.M.; Briones, M.R.S.; Bueno, M.R.P.; 
Camargo, A.A. ; Camargo, L.E.A.; Carraro, D.M. ; Carrer, H. ; Colauto, N.B.; 
Colombo, C; Costa, F.F.; Costa, M.CR.; Costa-Neto, CM.; Coutinho, L.L.; 
Cristofani, M. ; Dias-Neto, E. ; Docena, C; El-Dorry, H.; Facincani, A. P.; 
Ferreira, A.J.S. 

submitted to GenBank, June 2000 

A;Authors: Ferreira, V.C.A. ; Ferro, J.A. ; Fraga, J.S.; Franca, S.C; Franco, 
M.C; Frohme, M. ; Furlan, L.R.; Garnier, M. ; Goldman, G.H.; Goldman, M.H.S.; 
Gomes, S.L.; Gruber, A.; Ho, P.L.; Hoheisel, J.D.; Junqueira, M.L.; Kemper, 
E.L.; Kitajima, J. P.; Krieger, J.E.; Kuramae, E.E.; Laigret, F. ; Lambais, M.R.; 
Leite, L.C.C; Lemos, E.G.M.; Lemos, M.V.F.; Lopes, S.A.; Lopes, C.R.; Machado, 



J.A. ; Machado, M.A. ; Madeira, A.M.B.N.; Madeira, H.M.F.; Marino, C.L.; Marques, 
M.V. ; Martins, E.A.L. 

A;Authors: Martins, E.M.F.; Matsukuma, A.Y.; Menck, C.F.M.; Miracca, E.G.; 
Miyaki, C.Y.; Monteiro-Vitorello, C.B.; Moon, D.H.; Nagai, M.A. ; Nascimento, 

A. L.T.O.; Netto, L.E.S.; Nhani Jr., A.; Nobrega, F.G.; Nunes, L.R.; Oliveira, 
M.A.; de Oliveira, M.C.; de Oliveira, R.C.; Palmieri, D.A. ; Paris, A.; Peixoto, 

B. R.; Pereira, G.A.G.; Pereira Jr., H.A. ; Pesquero, J.B.; Quaggio, R.B.; 
Roberto, P.G.; Rodrigues, V.; Rosa, A.J. de M. ; de Rosa Jr., V.E.; de Sa, R.G.; 
Santelli, R.V. ; Sawasaki, H.E. 

A;Authors: da Silva, A.C.R.; da Silva, F.R.; da Silva, A.M.; Silva Jr., W.A. ; da 

Silveira, J.F.; Silvestri, M.L.Z.; Siqueira, W.J.; de Souza, A.A. ; de Souza, 

A. P.; Terenzi, M.F.; Truffi, D . ; Tsai, S.M.; Tsuhako, M.H.; Vallada, H. ; Van 

Sluys, M.A.; Ver jovski-Almeida, S.; Vettore, A.L.; Zago, M.A. ; Zatz, M. ; 

Meidanis, J.; Setubal, J.C. 

A; Reference number: A59328 

A; Contents: annotation 

C; Genetics : 

A; Gene: XF0840 

C; Superf amily : beta-galactosidase bga 

Query Match 92.9%; Score 39; DB 2; Length 612; 

Best Local Similarity 85.7%; Pred. No. 98; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I M 
Db 356 LQPPPLP 362 



RESULT 3 
T02441 

DNA- (apurinic or apyrimidinic site) lyase (ARP) [imported] - Arabidopsis 
thaliana 

N; Alternate names: hypothetical protein T26J13.5 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 05-Mar-1999 #sequence_revision 05-Mar-1999 #text_change 02-Feb-2001 
C;Accession: T02441; A84842 

R;Rounsley, S.D.; Lin, X.; Ketchum, K.A. ; Crosby, M.L.; Brandon, R.C.; Sykes, 
S.M.; Kaul, S.; Mason, T.M.; Kerlavage, A.R.; Adams, M.D.; Somerville, C.R.; 
Venter, J.C. 

submitted to the EMBL Data Library, June 1998 

A; Description : Arabidopsis thaliana chromosome II BAC T26J13 genomic sequence. 
A; Reference number: Z 14 673 
A;Accession: T02441 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type : DNA 
A; Residues: 1-536 <ROU> 

A;Cross-references : EMBL: AC004 625 ; NID : g324 1939 ; PID:g3241944 
A; Experimental source: cultivar Columbia 

R;Lin, X.; Kaul, S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V,; Buell, 
C.R.; Ketchum, K.A. ; Lee, J. J.; Ronning, CM.; Koo, H.; Moffat, K.S.; Cronin, 
L.A.; Shen, M. ; VanAken, S.E.; Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A. J.; Creasy, T.H.; Goodman, H.M.; Somerville, C.R.; Copenhaver, 
G.P.; Preuss, D. ; Nierman, W.C; White, O.; Eisen, J.A. ; Salzberg, S.L.; Eraser, 
CM.; Venter, J.C. 
Nature 402, 761-768, 1999 



A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana . 

A;Reference number: A84420; MUID : 200834 87 ; PMID : 10617 197 
A;Accession: A84842 
A; Status : preliminary 
A;Molecule type: DNA 
A;Residues: 1-536 <STO> 

A;Cross-references: GB:AE002093; NID : g3241 94 4 ; PIDN : AAC23731 . 1; GSPDB : GN00139 
C; Genetics : 

A;Gene: At2g41460; T26J13.5 
A;Map position: 2 

A;Introns: 18/2; 78/3; 110/2; 137/1; 210/1; 317/3; 354/3; 400/3; 416/3; 443/3; 
497/1 

Query Match 90.5%; Score 38; DB 2; Length 536; 

Best Local Similarity 85.7%; Fred. No. 1.2e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I : I I M I 

Db 264 MRPPPLP 270 



RESULT 4 
H86239 

protein F20B24.8 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence__revision 02-Mar-2001 #text_change 31-Mar-2001 
C;Accession: H86239 

R;Theologis, A.; Ecker, J.R. ; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0.; 
Alonso, J.; Altaf, H. ; Araujo, R. ; Bowman, C.L,; Brooks, S.Y.; Buehler, E. ; 
Chan, A.; Chao, Q. ; Chen, H. ; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K. ; Dunn, P.; Etgu, P.; 
Feldblyum, T.V.; Feng, J.; Fong, B. ; Fujii, G.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E.; 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin- 
Hooper, S.; Lee, A.; Lee, J.M. ; Lenz, C.A. ; Li, J.H.; Li, Y.; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pal, G.; Peterson, J.; Pham, P.K.; 
Rizzo, M. ; Rooney, T.; Rowley, D. ; Sakano, H. 

A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H. ; 
Tallon, L.J.; Tambunga, G.; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D.; Yu, G. ; Eraser, CM.; 
Venter, J.C; Davis, R.W. 

A;Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 

A;Reference number: A86141; MUID : 210167 19 ; PMID : 11130712 

A; Accession: H8 6239 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-542 <STO> 

A;Cross-references: GB:AE005172; NID : g6573750 ; PIDN :AAF17 670 . 1 ; GSPDB : GN0014 1 
C; Genetics : 
A;Gene: F20B24.8 
A;Map position: 1 



Query Match 90.5%; Score 38; DB 2; Length 542; 

Best Local Similarity 85.7%; Pred. No. l,2e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0 



Qy 1 MQPPPLP 7 

: I I I I I I 
Db 105 VQPPPLP 111 



RESULT 5 
T15525 

hypothetical protein C16B8.3 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 20-Sep-1999 #sequence_revision 20-Sep-1999 #text_change 21-Jan-2000 
C;Accession: T15525 
R;Bentley, D. 

submitted to the EMBL Data Library, November 1995 

A; Description : The sequence of C. elegans cosmid C16B8. 

A;Reference number: Z18365 

A; Accession : T 15 52 5 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-164 <BEN> 

A;Cross-references: EMBL:U41031; NID : gl098982 ; PID: gl098985; PIDN : AAAB2 620 . 1 ; 

CESP:C16B8 .3 

C; Genetics : 

A;Gene: CESP:C16B8.3 

C; Superf amily : proline-rich protein 

Query Match 88.1%; Score 37; DB 2; Length 164; 

Best Local Similarity 100.0%; Pred. No. 52; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 2 QPPPLP 7 

I I I I I I 

Db 97 QPPPLP 102 



RESULT 6 
T00644 

hypothetical protein F3I6.7 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 22-Oct-1999 
C; Accession: TO 064 4 

R; Federspiel, N.A. ; Palm, C.J.; Conway, A.B.; Kurtz, D.B.; Conway, A.R.; Au, M 
Araujo, R. ; Buehler, E.; Dewar, K. ; Feng, J.; Kim, C. ; Li, Y. ; Oji, O. ; Osborn 
B.I.; Shinn, P.; Sun, H. ; Toriumi, M. ; Vysotskaia, V.S.; Yu, G. ; Ecker, J.; 
Theologis, A.; Davis, R.W. 

submitted to the EMBL Data Library, February 1998 
A; Reference number: Z14197 
A; Accession: TO 064 4 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-178 <FED> 

A;Cross-references: EMBL : AC002396 ; NID : g2749918 ; PID: g2829866; GSPDB : GN00059 ; 

ATSP:F3I6.7 

C; Genetics : 



A; Gene: ATSP:F3I6.7 
A;Map position: 1 
A;Introns: 50/3; 87/1 

C; Super family : Arabidopsis thaliana hypothetical protein F3I5.7 

Query Match 88.1%; Score 37; DB 2; Length 178; 

Best Local Similarity 100.0%; Fred. No. 56; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I M I I I 

Db 162 QPPPLP 167 



RESULT 7 
T22685 

hypothetical protein F55A11.1 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 15-Oct-1999 
C; Access ion: T22 685 
R;Kershaw, J. 

submitted to the EMBL Data Library, May 1996 
A; Reference number: Z19600 
A; Access ion: T22 685 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-186 <WIL> 

A; Cross-references : EMBL: Z72 511; PIDN : CAA96655 .1; GSPDB : GN0 0023 ; CESP : F55A11 , 1 

A; Experimental source: clone F55A11 

C; Genetics : 

A; Gene: CESP:F55All. 1 

A;Map position: 5 

A;Introns: 33/3; 59/2; 146/2 

Query Match 88.1%; Score 37; DB 2; Length 186; 

Best Local Similarity 100.0%; Pred, No. 59; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 127 QPPPLP 132 



RESULT 8 
S55129 

transcription factor RGMl yeast ( Saccharomyces cerevisiae) 
N;Alternate names: protein YM8010.12c; protein YMR182c 
C; Species: Saccharomyces cerevisiae 

C;Date: 08-Jul-1995 #sequence_revision Ol-Sep-1995 #text_change 21-Jul-2000 
C;Accession: S55129; S17249 
R;Churcher, CM. 

submitted to the EMBL Data Library, June 1995 
A;Reference number: S55118 
A;Accession: S55129 
A; Molecule type: DNA 
A; Residues: 1-211 <CHU> 

A;Cross-references : EMBL:Z49808; NID:g854440; PID:g854452; MIPS:YMR182c 



A; Experimental source: strain AB972 
R;Estruch, F. 

Nucleic Acids Res. 19, 4873-4877, 1991 

A;Title: The yeast putative transcriptional repressor RGMl is a proline-rich 
zinc finger protein. 

A;Reference number: S17249; MUID : 92 02 0118 ; PMID:1923755 
A;Accession: S17249 
A;Molecule type: DNA 

A;Residues: 1-113 , , 115-211 <EST> 

A;Cross-references: EMBL:X59861; NID:g4317; PIDN : CAA42521 . 1 ; PID:g4318 

C; Genetics : 

A; Gene: SGD:RGM1 

A;Cross-references : SGD : S0004794 ; MIPS:YMR182c 
A;Map position: 13R 

C;Keywords: DNA binding; transcription factor; zinc finger 

Query Match 88.1%; Score 37; DB 2; Length 211; 

Best Local Similarity 100.0%; Pred. No. 67; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 154 QPPPLP 159 



RESULT 9 
A49266 

fas ligand - rat 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 13-Jan-1995 #sequence_revision 13-Jan-1995 #text_change 05-Nov-1999 
C;Accession: A49266 

R;Suda, T.; Takahashi, T.; Golstein, P.; Nagata, S. 
Cell 75, 1169-1178, 1993 

A;Title: Molecular cloning and expression of the Fas ligand, a novel member of 
the tumor necrosis factor family. 

A;Reference number: A49266; MUID : 94084792 ; PMID:7505205 
A;Accession: A49266 
A; Status: preliminary 
A;Molecule type: mRNA 
A; Residues: 1-278 <SUD> 

A;Cross-references : GB:U03470; NID:g440178; PIDN:AAC52129 . 1; PID:g440179 
C; Keywords: glycoprotein; transmembrane protein 

Query Match 88.1%; Score 37; DB 2; Length 278; 

Best Local Similarity 100.0%; Pred. No. 88; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 60 QPPPLP 65 



RESULT 10 
T00575 

probable WRKY-type DNA binding protein [imported] - Arabidopsis thaliana 
N; Alternate names: probable DNA~binding protein T27E13.1 
C; Species: Arabidopsis thaliana (mouse-ear cress) 



C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 23-Mar-2001 
C;Accession: T00575; B84706 

R;Rounsley, S.D.; Lin, X.; Ketchum, K.A. ; Crosby, M.L.; Brandon, R.C; Sykes, 
S.M.; Kaul, S.; Mason, T.M.; Kerlavage, A.R.; Adams, M.D.; Somerville, C.R.; 
Venter, J.C. 

submitted to the EMBL Data Library, May 1998 

A; Description: Arabidopsis thaliana chromosome II BAG T27E13 genomic sequence. 
A;Reference number: Z14178 
A; Accession: TO 057 5 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-393 <ROU> 

A;Cross-references : EMBL : ACQ 04 165 ; NID : g3150396; PID:g3150397 
A; Experimental source: cultivar Columbia 

R;Lin, X.; Kaul, S . ; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; 
Fujii, C.Y.; Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V.; Buell, 
C.R.; Ketchum, K.A. ; Lee, J. J.; Ronning, CM.; Koo, H.; Moffat, K.S.; Cronin, 
L.A.; Shen, M. ; VanAken, S.E.; Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, 
M.D.; Carrera, A. J.; Creasy, T.H.; Goodman, H.M.; Somerville, CR.; Copenhaver, 
G.P.; Preuss, D. ; Nierman, W.C; White, O.; Eisen, J.A. ; Salzberg, S.L.; Fraser, 
CM.; Venter, J.C 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana . 

A; Reference number: A84420; MUID : 2 0083487 ; PMID : 10617 197 
A; Accession: B847 0 6 
A; Status: preliminary 
A;Molecule type: DNA 
A; Residues: 1-393 <STO> 

A;Cross-references: GB:AE002093; NID : g2347191 ; PIDN : AAC16930 . 1 ; GSPDB : GN00139 
C; Genetics : 

A;Gene: At2g30250; T27E13.1 
A;Map position: 2 

A;Introns: 79/3; 117/3; 294/2; 348/2 

Query Match 88.1%; Score 37; DB 2; Length 393; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 50 QPPPLP 55 



RESULT 11 
T21349 

hypothetical protein F25H2.8 - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 04-Mar-2000 
C; Accession: T2134 9 
R;Wilkinson, J. 

submitted to the EMBL Data Library, September 1996 
A;Reference number: Z19411 
A; Accession: T2134 9 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-471 <WIL> 



A;Cross-references : EMBL:Z79754; PIDN : CAB02 096 , 1 ; GSPDB : GN00019 ; CESP:F25H2.8 

A; Experimental source: clone F25H2 

C; Genetics: 

A; Gene: CESP:F25H2.8 

A;Map position: 1 

A;Introns: 26/3; 138/3; 302/3; 353/3; 379/2 

C; Superf amily : Caenorhabditis elegans hypothetical protein F25H2 . 8 

Query Match 8 8.1%; Score 37; DB 2; Length 471; 

Best Local Similarity 71.4%; Pred. No. 1.5e+02; 

Matches 5; Conservative 2; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MQPPPLP 7 

: I I I I : I 
Db 451 LQPPPIP 457 



RESULT 12 
A25896 

beta-adrenergic receptor - turkey 

C; Species: Meleagris gallopavo (common turkey) 

C;Date: 16-Aug-1988 #sequence_revision 16-Aug-1988 #text_change 13-Aug-1999 
C;Accession: A25896 

R;Yarden;. Y. ; Rodriguez, H. ; Wong, S.K.F.; Brandt, D.R.; May, D.C.; Burnier, J.; 
Harkins, R.N.; Chen, E.Y.; Ramachandran, J.; Ullrich, A,; Ross, E.M. 
Proc. Natl. Acad. Sci. U.S.A. 83, 6795-6799, 1986 

A; Title: The avian beta-adrenergic receptor: primary structure and membrane 
topology . 

A;Reference number: 7^5896; MUID : 8 6313664 ; PMID:3018746 
A; Accession : A258 9 6 
A; Molecule type: mRNA 
A; Residues: 1-4 83 <YAR> 

A;Cross-references : GB:M14379; NID:g213891; PIDN :AAA4 9627 . 1 ; PID:g213892 
A; Experimental source: erythrocyte 
C; Superf amily : vertebrate rhodopsin 

C;Keywords: G protein-coupled receptor; transmembrane protein 

Query Match 8 8.1%; Score 37; DB 2; Length 4 83; 

Best Local Similarity 100.0%; Pred. No. 1.5e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 2 QPPPLP 7 

I I I I I I 

Db 256 QPPPLP 261 



RESULT 13 
E96633 

probable Serine/Threonine protein kinase F8A5.31 [imported] - Arabidopsis 
thaliana 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 31-Mar-2001 
C; Access ion : E96633 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, O. ; 
Alonso, J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E. ; 
Chan, A.; Chao, Q.; Chen, H.; Cheuk, R.F.; Chin, C.W. ; Chung, M.K.; Conn, L.; 
Conway, A.B.; Conway, A.R.; Creasy, T.H.; Dewar, K.; Dunn, P.; Etgu, P.; 



Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C.Y.; Gill, J.E.; Goldsmith, A.D.; 
Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E.; 
Kim, C.J.; Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B. ; Langin- 
Hooper, S.; Lee, A.; Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y. ; Lin, X.; Liu, 
S.X.; Liu, Z.A.; Luros, J.S.; Maiti, R. ; Marziali, A.; Militscher, J.; Miranda, 
M. ; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; Pai, G. ; Peterson, J.; Pham, P.K.; 
Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 

A; Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; 
Tallon, L.J.; Tambunga, G.; Toriumi, M.J.; Town, C.D.; Utterback, T.; van Aken, 
S.; Vaysberg, M. ; Vysotskaia, V.S.; Walker, M. ; Wu, D.; Yu, G. ; Fraser, CM.; 
Venter, J.C.; Davis, R.W. 

A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis . 

A; Reference number: A86141; MUID : 2 1016719 ; PMID : 111307 12 

A;Accession: E96633 

A; Status: preliminary 

A; Molecule type: DNA 

A; Residues: 1-588 <STO> 

A;Cross-references : GB:AE005173; NID: g2462749 ; PIDN : AAB71968 . 1 ; GSPDB : GN0014 1 

C; Genetics : 

A; Gene: F8A5.31 

A;Map position: 1 

Query Match 88.1%; Score 37; DB 2; Length 588; 

Best Local Similarity 100.0%; Pred. No. 1.9e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 550 QPPPLP 555 



RESULT 14 
S06733 

hydroxyproline-rich glycoprotein precursor - common tobacco 
C; Species: Nicotiana tabacum (common tobacco) 

C;Date: 28-Feb-1990 #sequence_revision 28-Feb-1990 #text_change 21-Jul-2000 
C;Accession: S06733 
R;Keller, B.; Lamb, C.J. 
Genes Dev. 3, 1639-1646, 1989 

A; Title: Specific expression of a novel cell wall hydroxyproline-rich 
glycoprotein gene in lateral root initiation. 
A;Reference number: S06733; MUID : 9012 82 63 ; PMID:2612909 
A;Accession: S06733 
A; Molecule type: DNA 
A; Residues: 1-620 <KEL> 

A; Cross-references : EMBL:X13885; NID:gl9866; PIDN : CAA32090 . 1; PID:gl9867 
C; Superf amily : hydroxyproline-rich glycoprotein 
C; Keywords : glycoprotein 

Query Match 88.1%; Score 37; DB 2; Length 620; 

Best Local Similarity 100.0%; Pred. No. 2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



2 QPPPLP 7 
I I I I I I 



Db 



425 QPPPLP 430 



RESULT 15 
JN0809 

drebrin E (clone gDbhl3) - human 
C; Species: Homo sapiens (man) 

C;Date: 14-Jul-1994 #sequence_revision 14-Jul-1994 #text_change 21-Jul-2000 
C; Accession: JN0 8 09 

R;Toda, M. ; Shirao, T.; Minoshima, S.; Shimizu, N.; Toya, S.; Uyemura, K. 
Biochem. Biophys . Res. Commun. 196, 468-472, 1993 

A; Title: Molecular cloning of cDNA encoding human drebrin E and chromosomal 
mapping of its gene. 

A; Reference number: JN0809; KUID: 94030036; PMID: 8216329 
A; Access ion: JN08 09 
A; Molecule type: mRNA 
A; Residues: 1-649 <TOD> 

A; Cross-references: DDBJ:D17530; NID:g498650; PIDN : B7\A044 80 . 1 ; PID:g498651 
A; Experimental source: brain 

C; Comment: This protein is an actin-binding protein. 

C; Genetics : 

A;Map position: 5 

C; Keywords: actin binding 

Query Match 88.1%; Score 37; DB 2; Length 649; 

Best Local Similarity 100,0%; Fred. No. 2.1e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 363 QPPPLP 368 



Search completed: August 24, 2004, 15:52:38 
Job time : 10.791 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search^ using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



August 24, 2004, 15:51:19 ; Search time 25.3358 Seconds 

(without alignments) 
86.825 Million cell updates/sec 

US-09-641-801-1 
42 

1 MQPPPLP 7 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 1295152 seqs, 314255058 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length; 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1295152 



Database 



Published_Applications_AA: 

1 : / cgn2__6/ptodata/l/pubpa 

2 : / cgn2_6/ptodata/l/pubpa 

3 : / cgn2_6/ptodata/ 1/pubpa 

4 : / cgn2_6/ptodat a/ 1/pubpa 

5 : /cgn2_6/ptodata/l/pubpa 

6 : /cgn2_6/ptodata/l/pubpa 

7 : /cgn2_6/ptodat a/ 1/pubpa 

8: /cgn2_6/ptodata/l/pubpa 

9 : /cgn2_6/ptodata/l/pubpa 

10; /cgn2_6/ptodata/l/pubp 

11: /cgn2_6/ptodata/l/pubp 

12 : /cgn2_6/ptodata/ 1/pubp 

13: /cgn2_6/ptodata/ 1/pubp 

14: /cgn2_6/ptodata/ 1/pubp 

15: /cgn2_6/ptodata/ 1/pubp 

16; /cgn2_6/ptodata/ 1/pubp 

17: /cgn2_6/ptodata/ 1/pubp 

18: /cgn2_6/ptodata/ 1/pubp 



a/US07_PUBCOMB.pep: * 
a/PCT_NEW_PUB.pep: * 
a/US06_NEW_PUB.pep: ^ 
a /US 0 6_PUBC0MB . pep : * 
a/US07_NEW_PUB . pep : * 
a/ PCTUS_PUBCOMB . pep : + 
a/USO 8_NEW_PUB . pep : * 
a/US08_PUBCOMB.pep:* 
a/US09A_PUBCOMB.pep: * 
aa/US09B_PUBCOMB.pep: * 
aa/US09C_PUBCOMB.pep: * 
aa/US09_NEW_PUB.pep: ^ 
aa/US10A_PUBCOMB.pep; * 
aa/US10B_PUBCOMB.pep: * 
aa/US10C_PUBCOMB.pep: * 
aa /US 1 0_NEW_PUB . pep : ^ 
aa/US60_NEW_PUB.pep: * 
aa/US 6 0_PUBCOMB . pep : ^ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 
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264209, 
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37 
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-10- 
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-270730 
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270730, 
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37 


88. 
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-963 
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Sequence 


111116, 


35 


37 


88. 


1 
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16 


us 


-10- 


437 


-963 


-131050 


Sequence 


131050, 


36 


37 


88. 


1 


162 


16 


us 


-10- 


437 


-963 


-203781 


Sequence 


203781, 


37 


37 


88. 


1 


163 


16 


us 


-10- 


437 


-963 


-155754 


Sequence 


155754, 


38 


37 


88 . 
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16 


us 


-10- 


437 


-963 


-179013 


Sequence 


179013, 


39 


37 


88. 


1 
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16 
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-10- 


437 


-963 


-130977 


Sequence 


130977, 


40 


37 


88. 


1 
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16 
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-10- 


437 


-963 


-203782 


Sequence 


203782, 


41 


37 


88. 


1 
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16 


us 


-10- 


437 


-963- 


-116550 


Sequence 


116550, 


42 


37 


88. 


1 
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16 


us 


-10- 


437 


-963- 


-185697 


Sequence 


185697, 


43 


37 


88. 


1 
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12 


us- 


-10- 


424 


-599- 


-225072 


Sequence 


225072, 


44 


37 


88. 


1 
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16 


us- 


-10- 


437 


-963- 


-175982 


Sequence 


175982, 


45 


37 


88. 


1 


194 


16 


us- 


-10- 


437 


-963- 


-146891 


Sequence 


146891, 



ALIGNMENTS 



RESULT 1 
US-10-281-652-1 

; Sequence 1, Application US/10281652 
; Publication No. US20030091606A1 
; GENER7VL INFORMATION: 
; APPLICANT: STANTON, G. John 



; APPLICANT: HUGHES, Thomas K. 
; APPLICANT: BOLDOGH, Istvan 

; TITLE OF INVENTION: USE OF COLOSTRININ, CONSTITUENT PEPTIDES THEREOF, AND 
TITLE OF INVENTION: ANALOGS THEREOF AS OXIDATIVE STRESS REGULATORS 

; FILE REFERENCE: 265.00220101 

; CURRENT APPLICATION NUMBER: US/10/28 1, 652 

; CURRENT FILING DATE: 2 002-10-28 

; PRIOR APPLICATION NUMBER: US/ 09/ 64 1 , 8 03 

; PRIOR FILING DATE: 2000-08-17 

; PRIOR APPLICATION NUMBER: 60/149,310 

; PRIOR FILING DATE: 1999-08-17 

; NUMBER OF SEQ ID NOS : 34 

SOFTWARE: PatentlnVer. 2.1 

; SEQ ID NO 1 
LENGTH: 7 

; TYPE: PRT 

; ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence: synthetic 
OTHER INFORMATION: peptide 
US-10-281-652-1 

Query Match 100.0%; Score 42; DB 14; Length 7; 

Best Local Similarity 100.0%; Pred. No. 1.2e+06; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I I I 
Db 1 MQPPPLP 7 



RESULT 2 

US-1 0-424-599- 14 74 58 

; Sequence 147458, Application US/10424599 

; Publication No. US2 0040031072A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa Thomas J 

; APPLICTM^T: Kovalic David K 

; APPLICANT: Zhou Yihua 

; APPLICANT: Cao Yongwei 

TITLE OF INVENTION: Soy Nucleic Acid Molecules and Other Molecules Associated 
With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
; FILE REFERENCE: 38-2 1 ( 53223 ) B 
; CURRENT APPLICATION NUMBER: US/ 1 0/42 4 , 5 99 
; CURRENT FILING DATE: 2003-04-28 
; NUMBER OF SEQ ID NOS: 285684 
; SEQ ID NO 147458 
; LENGTH: 76 
; TYPE: PRT 
; ORGANISM: Glycine max 
; FEATURE : 

; OTHER INFORMATION: Clone ID: PAT_MRT3847_104 173C . 1 . pep 
US-10-424-599-147458 



Query Match 92.9%; Score 39; DB 12; Length 76; 

Best Local Similarity 85.7%; Pred. No. 2.4e+02; 



Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 19 LQPPPLP 25 



RESULT 3 

US-09-864-761-37187 

; Sequence 37187, Application US/09864761 

; Patent No. US20020048763A1 

; GENERAL INFORMATION: 

; APPLICANT: Penn, Sharron G. 

; APPLICANT: Rank, David R. 

; APPLICANT: Hanzel, David K. 

; APPLICANT: Chen, Wensheng 

; TITLE OF INVENTION: HUMAN GENOME-DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR 

; TITLE OF INVENTION: GENE EXPRESSION ANALYSIS BY MICROARRAY 
; FILE REFERENCE: Aeomica-X-1 

; CURRENT APPLICATION NUMBER: US/ 0 9/ 8 64 , 7 61 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/180,312 

; PRIOR FILING DATE: 2000-02-04 

; PRIOR APPLICATION NUMBER: US 60/207,456 

; PRIOR FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: US 09/632,366 

; PRIOR FILING DATE: 2000-08-03 

; PRIOR APPLICATION NUMBER: GB 24263.6 

; PRIOR FILING DATE: 2000-10-04 

PRIOR APPLICATION NUMBER: US 60/236,359 

PRIOR FILING DATE: 2000-09-27 
; PRIOR APPLICATION NUMBER: PCT/USOl/00666 

PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00667 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00664 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00669 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00665 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00668 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00663 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00662 

PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00661 
; PRIOR FILING DATE: 2001-01-30 

PRIOR APPLICATION NUMBER: PCT/USOl/00670 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: US 60/234,687 

PRIOR FILING DATE: 2000-09-21 

PRIOR APPLICATION NUMBER: US 09/608,408 

PRIOR FILING DATE: 2000-06-30 

PRIOR APPLICATION NUMBER: US 09/774,203 



PRIOR FILING DATE: 2001-01-29 
NUMBER OF SEQ ID NOS : 49117 

SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 
SEQ ID NO 37187 
LENGTH: 132 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 



; OTHER 


INFORMATION: 


MAP TO AL096814.il 




; OTHER 


INFORMATION: 


EXPRESSED 


IN 


HEART, SIGNAL =3.3 




OTHER 


INFORMATION: 


EXPRESSED 


IN 


ADULT LIVER, SIGNAL = 


1.8 


OTHER 


INFORMATION: 


EXPRESSED 


IN 


BRAIN, SIGNAL =2.7 




OTHER 


INFORMATION: 


EXPRESSED 


IN 


BONE MARROW, SIGNAL = 


2.7 


OTHER 


INFORMATION: 


EXPRESSED 


IN 


PLACENTA, SIGNAL = 2. 


7 


OTHER 


INFORMATION: 


EXPRESSED 


IN 


HELA, SIGNAL =3.3 




OTHER 


INFORMATION: 


EXPRESSED 


IN 


HBLIOO, SIGNAL =2.6 




OTHER 


INFORMATION: 


EXPRESSED 


IN 


BT474, SIGNAL =2.7 




OTHER 


INFORMATION: 


EXPRESSED 


IN 


FETAL LIVER, SIGNAL = 


3 


; OTHER 


INFORMATION: 


EXPRESSED 


IN 


LUNG, SIGNAL =3.3 




; OTHER 


INFORMATION: 


EST_HUMAN 


HIT: AW833678.1, EVALUE 


4. OOe 


US-09-864- 


-761-37187 











Query Match 92.9%; Score 39; DB 9; Length 132; 

Best Local Similarity 85.7%; Pred. No. 3.9e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 116 LQPPPLP 122 



RESULT 4 

US-1 0-4 37-9 63- 14 394 8 

Sequence 143948, Application US/10437963 
Publication No. US2 004 0123343A1 
GENERAL INFORMATION: 
APPLICANT: La Rosa, Thomas J. 
APPLIC7\NT: Kovalic, David K. 
APPLICANT: Zhou, Yihua 
APPLICANT: Cao, Yongwei 
APPLICANT: Wu, Wei 
APPLICANT: Boukharov, Andrey A. 
APPLICTMSTT: Barbazuk, Brad 
APPLICANT: Li, Ping 

TITLE OF INVENTION: Rice Nucleic Acid Molecules and Other Molecules 
Associated With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 38-2 1 ( 5322 1 ) B 
CURRENT APPLICATION NUMBER: US/10/437 , 963 
CURRENT FILING DATE: 2003-05-14 
NUMBER OF SEQ ID NOS: 204966 
SEQ ID NO 143948 
LENGTH: 332 
TYPE: PRT 

ORGANISM: Oryza sativa 
FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT453 0_4 4 8 07C . 1 . pep 



US-10-4 37-9 63-14 3 948 



Query Match 92.9%; Score 39; DB 16; Length 332; 

Best Local Similarity 85.7%; Pred. No. 8.7e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 85 LQPPPLP 91 



RESULT 5 

US-10-437-963-175374 

Sequence 175374, Application US/10437963 
Publication No, US20040123343A1 
GENERAL INFORMATION: 
APPLICANT: La Rosa, Thomas J. 
APPLIC7\NT: Kovalic, David K. 
APPLICANT: Zhou, Yihua 
APPLICANT: Cao, Yongwei 
APPLICANT: Wu, Wei 
APPLICANT: Boukharov, Andrey A. 
APPLICANT: Barbazuk, Brad 
APPLICANT: Li, Ping 

TITLE OF INVENTION: Rice Nucleic Acid Molecules and Other Molecules 
Associated With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 38-2 1 ( 5322 1 ) B 
CURRENT APPLICATION NUMBER: US/ 1 0/ 437 , 9 63 
CURRENT FILING DATE: 2003-05-14 
NUMBER OF SEQ ID NOS : 204966 
SEQ ID NO 175374 
LENGTH: 4 07 
TYPE: PRT 

Oryza sativa 



ORGANISM: 
FEATURE : 
NAME/ KEY: 
LOCATION: 



unsure 
(1) . . (407) 

OTHER INFORMATION: unsure at all Xaa locations 
FEATURE: 

OTHER INFORMATION: Clone ID: PAT_MRT4 53 0_73225C. 1 . 
US-10-437-963-175374 



pep 



Query Match 92. 9%; 

Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 39; DB 16; Length 407; 
Pred. No. le+03; 
1 ; Mismatches 0 ; Indels 



0 ; Gaps 



0; 



Qy 



Db 



1 MQPPPLP 7 

: I M I I I 
105 LQPPPLP 111 



RESULT 6 

US-10-369-493-9267 

; Sequence 9267, Application US/10369493 
; Publication No. US2 0030233675A1 
; GENERAL INFORMATION: 



APPLIC7\NT: Cao, Yongwei 
APPLICANT: Hinkle, Gregory J. 
APPLICANT: Slater, Steven C. 
APPLICANT: Goldman, Barry S. 
APPLICANT: Chen, Xianfeng 

TITLE OF INVENTION: EXPRESSION OF MICROBIAL PROTEINS IN PLANTS FOR PRODUCTION 
OF 

TITLE OF INVENTION: PLANTS WITH IMPROVED PROPERTIES 
FILE REFERENCE: 38- 1 0 ( 52 052 ) B 
CURRENT APPLICATION NUMBER: US/ 1 0/3 69 , 4 93 
CURRENT FILING DATE: 2003-02-28 
PRIOR APPLICATION NUMBER: US 60/360,039 
PRIOR FILING DATE: 2002-02-21 
NUMBER OF SEQ ID NOS ; 47374 
SEQ ID NO 9267 
LENGTH: 572 
TYPE: PRT 

ORGANISM: Xylella fastidiosa 
US-10-369-493-9267 



Query Match 92 . 9%; 

Best Local Similarity 85.7%; 
Matches 6; Conservative 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 356 LQPPPLP 362 



Score 39; DB 15; Length 572; 
Pred. No. 1.4e+03; 
1; Mismatches 0; Indels 



0; Gaps 



0; 



RESULT 7 

US-10-369-493-9495 

Sequence 9495, Application US/10369493 
Publication No, US20030233675A1 
GENERAL INFORMATION: 
APPLICANT : Cao, Yongwei 
APPLICANT: Hinkle, Gregory J. 
APPLICANT: Slater, Steven C. 
APPLICANT: Goldman, Barry S. 
APPLICANT: Chen, Xianfeng 

TITLE OF INVENTION: EXPRESSION OF MICROBIAL PROTEINS IN PLANTS FOR PRODUCTION 
OF 

TITLE OF INVENTION: PLANTS WITH IMPROVED PROPERTIES 
FILE REFERENCE: 3 8-1 0 ( 52 052 ) B 
CURRENT APPLICATION NUMBER: US/ 1 0/ 3 69 , 4 93 
CURRENT FILING DATE: 2003-02-28 
PRIOR APPLICATION NUMBER: US 60/360,039 
PRIOR FILING DATE: 2002-02-21 
NUMBER OF SEQ ID NOS: 47374 
SEQ ID NO 9495 
LENGTH: 572 
TYPE: PRT 

ORGANISM: Xylella fastidiosa 
US-10-369-493-9495 



Query Match 92.9%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 39; DB 15; Length 572; 
Pred. No. 1.4e+03; 
1; Mismatches 0; Indels 0; Gaps 



0; 



Qy 

Db 



1 MQPPPLP 7 
: I I I I I I 
356 LQPPPLP 362 



RESULT 8 

US-10-369-493-17539 

Sequence 17539, Application US/10369493 
Publication No. US20030233675A1 
GENERAL INFORMATION: 



APPLICANT 
APPLIC7\NT 
APPLICANT 
APPLICANT 
APPLICANT 



Cao, Yongwei 
Hinkle, Gregory J. 
Slater, Steven C. 
Goldman, Barry S. 
Chen, Xianfeng 



TITLE OF INVENTION: EXPRESSION OF MICROBIAL PROTEINS IN PLANTS FOR PRODUCTION 
OF 

TITLE OF INVENTION: PLANTS WITH IMPROVED PROPERTIES 
FILE REFERENCE: 38~10 ( 52 052 ) B 
CURRENT APPLICATION NUMBER: US/10/369, 493 
CURRENT FILING DATE: 2003-02-28 
PRIOR APPLICATION NUMBER: US 60/360,039 
PRIOR FILING DATE: 2002-02-21 
NUMBER OF SEQ ID NOS : 47374 
SEQ ID NO 17539 
LENGTH: 612 
TYPE: PRT 

ORGANISM: Xylella fastidiosa 
US-10-369-493-17539 



Query Match 92.9%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 356 LQPPPLP 362 



Score 39; DB 15; Length 612; 
Pred. No. 1.5e+03; 
1; Mismatches 0; Indels 



0; Gaps 



0; 



RESULT 9 

US-10-424-599-148127 

; Sequence 148127, Application US/10424599 

; Publication No. US2 0040031072A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa Thomas J 

; APPLICANT: Kovalic David K 

; APPLICANT: Zhou Yihua 

; APPLICANT: Cao Yongwei 

TITLE OF INVENTION: Soy Nucleic Acid Molecules and Other Molecules Associated 
With 

; TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 

; FILE REFERENCE: 3 8-2 1 ( 5322 3 ) B 

; CURRENT APPLICATION NUMBER: US/ 10/424 , 5 99 

; CURRENT FILING DATE: 2003-04-28 

; NUMBER OF SEQ ID NOS: 285684 

; SEQ ID NO 148127 



LENGTH: 18 9 
; TYPE: PRT 
; ORGANISM: Glycine max 

FEATURE : 

NAME/KEY: unsure 
; LOCATION: (1) . . (189) 

; OTHER INFORMATION: unsure at all Xaa locations 
FEATURE : 

OTHER INFORMATION: Clone ID: PAT_MRT3847_1047 80C . 1 . pep 
US-10-424-599-148127 

Query Match 90,5%; Score 38; DB 12; Length 189; 

Best Local Similarity 85.7%; Pred. No, 7.2e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I : I I M I 

Db 105 MKPPPLP 111 



RESULT 10 

US-10-4 0 8-7 65A-2896 

Sequence 2896, Application US/10408765A 
Publication No. US20040101874A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Ghosh, Soumitra S, 
Fahy, Eoin D. 
Zhang, Bing 
Gibson, Bradford W, 
Taylor, Steven W. 
Glenn, Gary M. 
Warnock, Dale E, 
TITLE OF INVENTION: TARGETS FOR THERAPEUTIC INTERVENTION 
TITLE OF INVENTION: IDENTIFIED IN THE MITOCHONDRIAL PROTEOME 
FILE REFERENCE: 660088,465 

CURRENT APPLICATION NUMBER: US/10/408, 765A 
CURRENT FILING DATE: 2003-04-04 
NUMBER OF SEQ ID NOS : 3077 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 2896 
LENGTH: 25 6 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-4 0 8-7 65A-2896 

Query Match 90.5%; Score 38; DB 16; Length 256; 

Best Local Similarity 85.7%; Pred. No. 9.3e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 191 VQPPPLP 197 



RESULT 11 

US- 10-42 4-5 99- 14 742 6 

; Sequence 147426, Application US/10424599 



; Publication No. US2 004 0031072A1 

; GENERAL INFORMATION: 

; APPLICANT: La Rosa Thomas J 

; APPLICANT: Kovalic David K 

; APPLICANT: Zhou Yihua 

; APPLICANT: Cao Yongwei 

; TITLE OF INVENTION: Soy Nucleic Acid Molecules and Other Molecules Associated 
With 

; TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 

; FILE REFERENCE: 38-2 1 ( 53223 ) B 

; CURRENT APPLICATION NUMBER: US/10/424 , 599 

; CURRENT FILING DATE: 2003-04-28 

; NUMBER OF SEQ ID NOS : 285684 

; SEQ ID NO 147426 

LENGTH: 414 

TYPE: PRT 
; ORGANISM: Glycine max 
; FEATURE: 

; OTHER INFORMATION: Clone ID: PAT_MRT3847_104144C . 1 . pep 
US- 10-42 4-5 99- 14 7 42 6 

Query Match 90.5%; Score 38; DB 12; Length 414; 

Best Local Similarity 85.7%; Pred'. No. 1.4e+03; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I : I I I I I 

Db 105 MKPPPLP 111 



RESULT 12 

US- 10-437-9 63- 14321 9 

Sequence 143219, Application US/10437963 
Publication No. US20040123343A1 
GENERAL INFORMATION: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
APPLICANT: 
TITLE OF INVENTION 
Associated With 

TITLE OF INVENTION: Plants and Uses Thereof for Plant Improvement 
FILE REFERENCE: 3 8-2 1 { 5322 1 ) B 
CURRENT APPLICATION NUMBER: US/ 10/ 4 37 , 963 
CURRENT FILING DATE: 2003-05-14 
NUMBER OF SEQ ID NOS: 204966 
SEQ ID NO 143219 
LENGTH: 812 
TYPE: PRT 

ORGANISM: Oryza sativa 
FEATURE: 

OTHER INFORMATION: Clone ID: PAT_MRT4 530_4 4 14 9C . 1 . pep 
US-10-437-963-143219 



La Rosa, Thomas J. 
Kovalic, David K. 
Zhou, Yihua 
Cao, Yongwei 
Wu, Wei 

Boukharov, Andrey A. 
Barbazuk, Brad 
Li, Ping 

Rice Nucleic Acid Molecules and Other Molecules 



Query Match 90.5%; Score 38; DB 16; Length 812; 

Best Local Similarity 85.7%; Pred. No. 2.5e+03; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 

Db 26 VQPPPLP 32 



RESULT 13 
US-09-899-495-68 

; Sequence 68, Application US/09899495 
; Publication No. US20030088060A1 
; GENERAL INFORMATION: 

; APPLICANT: Benjamin, Christopher W. 

; APPLICANT: Roberts, Steven L. 

; APPLICANT: Karnovsky, Alia M. 

; APPLICANT: Ruble, Cara L. 

; TITLE OF INVENTION: Human Ion Channels 

; FILE REFERENCE: 00188US1 

; CURRENT APPLICATION NUMBER: US/ 09/ 8 99 , 4 95 

; CURRENT FILING DATE: 2001-07-05 

; PRIOR APPLICATION NUMBER: 60/215,815 

; PRIOR FILING DATE: 2000-07-05 

; PRIOR APPLICATION NUMBER: 60/216,481 

; PRIOR FILING DATE: 2000-07-06 

; PRIOR APPLICATION NUMBER: 60/216,479 

; PRIOR FILING DATE: 2000-07-06 

; PRIOR APPLICATION NUMBER: 60/216,482 

PRIOR FILING DATE: 2000-07-06 

PRIOR APPLICATION NUMBER: 60/217,096 

PRIOR FILING DATE: 2000-07-10 
; NUMBER OF SEQ ID NOS : 125 

SOFTWARE: Patentin version 3.0 
; SEQ ID NO 68 
LENGTH: 38 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09~899-495-68 

Query Match 88.1%; Score 37; DB 10; Length 38; 

Best Local Similarity 100.0%; Pred. No. 2.4e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 25 QPPPLP 30 



RESULT 14 
US-09-802-668-109 

; Sequence 109, Application US/09802668 

; Publication No. US2 00301907 14A1 

; GENER7VL INFORMATION: 

; APPLICANT: Wood, Linda 

; APPLICANT: Vogeli, Gabriel 



APPLICANT: Karnovsky, Alia 
APPLICANT: Linske-0 ' Connell, Lisa I. 
APPLICANT: Wang, Jun 
APPLICANT: Liu, Derong 

TITLE OF INVENTION: Human Ion Channels 
FILE REFERENCE: 00069US1 

CURRENT APPLICATION NUMBER: US/ 09/ 8 02 , 668 
CURRENT FILING DATE: 2 001-03-09 
PRIOR APPLICATION NUMBER: 60/188,517 
PRIOR FILING DATE: 2000-03-10 
NUMBER OF SEQ ID NOS : 117 
SOFTWARE: Patentin version 3,0 
SEQ ID NO 109 
LENGTH: 3 8 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-802-668-109 

Query Match 88.1%; Score 37; DB 10; Length 38; 

Best Local Similarity 100.0%; Pred. No. 2.4e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 2 QPPPLP 7 

I I I I I I 

Db 25 QPPPLP 30 



RESULT 15 
US-10-243-475-109 

; Sequence 109, Application US/10243475 

; Publication No. US20030194720A1 

; GENERAL INFORMATION: 

; APPLICANT: Roberds, Steven L. 

; APPLICANT: Benjamin, Christopher W. 

; APPLICANT: Karnovsky, Alia M. 

; APPLICANT: Ruble, Cara L. 

; TITLE OF INVENTION: Human Ion Channels 

; FILE REFERENCE: PHRM0039-100 

; CURRENT APPLICATION NUMBER: US/10/243, 475 

; CURRENT FILING DATE: 2003-01-16 

; PRIOR APPLICATION NUMBER: 60/403,254 

; PRIOR FILING DATE: 2002-08-13 

; PRIOR APPLICATION NUMBER: 60/318,733 

; PRIOR FILING DATE: 2001-09-12 

; NUMBER OF SEQ ID NOS: 138 

; SOFTWARE: Patentin version 3.2 

; SEQ ID NO 109 

LENGTH: 3 8 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-243-475-109 

Query Match 88.1%; Score 37; DB 14; Length 38; 

Best Local Similarity 100.0%; Pred. No. 2.4e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 



Qy 



2 QPPPLP 7 



Db 25 QPPPLP 30 

Search completed: August 24, 2004, 16:41:11 
Job time : 28.3358 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



August 24, 2004, 15:23:00 ; Search time 21.6269 Seconds 

(without alignments) 
102.124 Million cell updates/sec 

US-09-641-801-1 
42 

1 MQPPPLP 7 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1017041 



Database 



SPTREMBL 25:* 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 



sp_archea : 
sp_bacteria : * 
sp_f ungi : * 
sp_human : * 
sp_invertebrate : * 
sp_mammal : * 
sp_mhc : * 
sp_organelle : * 
sp_phage : * 

sp_plant : * 

sp_rodent : * 

sp_virus : * 

sp_vertebrate : * 

sp_unclassif led : * 

sp_rvirus : * 

sp_bacteriap : * 

sp_archeap : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result Query 

No. Score Match Length DB ID 



Description 



1 


42 


100 . 


, 0 


2 61 


5 


QyVAXo 


Q9vax5 drosophila 


2 


40 


95 , 


. 2 


600 


2 


Q8GPV8 


Q8gpv8 pseudomonas 


3 


40 


95 , 


. 2 


785 


10 


Q8RXF1 


Q8rxfl arabidopsis 


4 


40 


95 . 


. 2 


1776 


10 


Q9iyiA20 


Q9ma20 arabidopsis 


5 


39 


92 . 


, 9 


612 


16 


Q9PF38 


Q9pr38 xyleila ras 


6 


39 


92 . 


, 9 


612 


16 


Q T TV TO 

Q87AJ2 


Qo/ajz xylella ras 


7 


39 


92 . 


, 9 


1404 


5 


Q9GRR6 


Q9grr6 leishrtiania 


8 


38 


90 . 


, 5 


542 


10 


Qy sgyd 


Q9sgy5 arabidopsis 


9 


38 


90 . 


, 5 


678 


4 


Q8TES7 


Q8tes7 homo sapien 


10 


38 


90 . 


, 5 


812 


10 


Q8LIE4 


Q81ie4 oryza sativ 


11 


38 


90 . 


, 5 


2165 


5 


Q8T1F5 


Q8tlf5 dictyostelx 


12 


38 


90 . 


, 5 


287 9 


5 


Q8IC57 


Q8ic57 Plasmodium 


13 


37 


8 8 . 


. 1 


44 


8 


Q85G37 


QoogJ / cyanidioscn 


14 


37 


88 . 


, 1 


72 


15 


Q87610 


Q87610 chimpanzee 


15 


37 


88 . 


, 1 


73 


15 


Q87603 


Q87603 chimpanzee 


16 


37 


88 . 


, 1 


99 


15 


Q87482 


Q87482 chimpanzee 


17 


37 


88 . 


. 1 


126 


5 


Q9NDS3 


Q9nds3 penaeus jap 


18 


37 


88 . 


. 1 


126 


5 


Q9NDS2 


Q9nds2 penaeus jap 


19 


37 


88 , 


. 1 


128 


10 


Q8GYP2 


Q8gyp2 arabidopsis 


20 


37 


88 . 


, 1 


137 


10 


Q84QQ0 


Q84qq0 oryza sativ 


21 


37 


88 . 


, 1 


164 


5 


Q18043 


Q18043 caenorhabdi 


22 


37 


88 . 


. 1 


178 


10 


048681 


048681 arabidopsis 


23 


37 


88 . 


, 1 


186 


5 


Q20796 


Q20796 caenorhabdi 


24 


37 


88 . 


, 1 


190 


11 


Q811B5 


Q811b5 mus musculu 


25 


37 


88 , 


, 1 


222 


10 


Q8S1H5 


Q8slh5 oryza sativ 


26 


37 


88 . 


, 1 


223 


16 


Q8PI93 


Q8pi93 xanthomonas 


27 


37 


88 . 


, 1 


223 


16 


Q8P6X9 


Q8p6x9 xanthomonas 


28 


37 


88 . 


, 1 


226 


10 


Q7XDT9 


Q7xdt9 oryza sativ 


29 


37 


88 . 


. 1 


276 


16 


Q8DHT8 


Q8dht8 synechocoGc 


30 


37 


88 . 


, 1 


294 


10 


Q9LG53 


Q91g53 oryza sativ 


31 


37 


88 . 


, 1 


302 


4 


Q86U16 


Q86ul6 homo sapien 


32 


37 


88 . 


. 1 


322 


4 


Q9BSM2 


Q9bsm2 homo sapien 


33 


37 


88 . 


, 1 


325 


16 


Q8 9MW5 


Q89mw5 bradyrhizob 


34 


37 


88 . 


, 1 


346 


11 


Q99LN2 


Q991n2 mus musculu 


35 


37 


88 , 


. 1 


353 


11 


Q8BTT8 


Q8btt8 mus musculu 


36 


37 


88 . 


, 1 


357 


4 


Q96RZ4 


Q96rz4 homo sapien 


37 


37 


88 . 


, 1 


358 


4 


Q8N249 


Q8n249 homo sapien 


38 


37 


88 . 


. 1 


401 


4 


Q8IVY2 


Q8ivy2 homo sapien 


39 


37 


88 . 


. 1 


420 


16 


Q7U607 


Q7u607 synechococc 


40 


37 


88 . 


, 1 


421 


10 


Q9FTS7 


Q9fts7 oryza sativ 


41 


37 


88. 


, 1 


423 


10 


Q8H5B0 


Q8h5b0 oryza sativ 


42 


37 


88. 


, 1 


431 


5 


Q8SXY7 


Q8sxy7 drosophila 


43 


37 


88. 


, 1 


440 


5 


Q9V6M1 


Q9v6ml drosophila 


44 


37 


88, 


, 1 


456 


5 


Q9N9U6 


Q9n9u6 leishmania 


45 


37 


88. 


, 1 


471 


4 


Q7Z6B2 


Q7z6b2 homo sapien 



ALIGNMENTS 



RESULT 1 
Q9VAX5 

ID Q9VAX5 PRELIMINARY; 
AC Q9VAX5; 

DT Ol-MAY-2000 (TrEMBLrel. 
DT Ol-OCT-2002 (TrEMBLrel. 
DT Ol-MAR-2003 (TrEMBLrel. 



PRT; 261 AA. 
13, Created) 

22, Last sequence update) 

23, Last annotation update) 



DE CG12872 protein. 

GN CG12872. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=2 019600 6; PubMed=10731132 ; 

RA Adams M.D., Celniker S.E,, Holt R.A., Evans C.A. , Gocayne J.D., 

RA T^anatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M., Henderson S.N.^ 

RA Sutton G.G., Wortman J.R., Yandell Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G,^ Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C, Baxter E.G., Helt G. , Nelson C.R., Miklos G.L.G., 

RA 7\bril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch Baldwin D. , 

RA Ballew R.M. , Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M. , Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D,, Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M., Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. , 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z . , Guan P., Harris M., 

RA Harris N.L., Harvey D., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A. , Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z . , Kennison J.A. , Ketchum K.A., 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A.A. , Li J., Li Z., Liang Y., Lin X., 

RA Liu X., Mattel B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G. , Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R,, Pacleb J,M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Purl V., Reese M.G., 

RA Reinert K., Remington K. , Saunders R.D.C., Scheeler F., Shen H., 

RA Shue B.C., Siden-Kiamos I,, Simpson M., Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M., Strong R., Sun E., 

RA Svirskas R. , Tector C, Turner R. , Venter E. , Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.~F., Zaveri J.S., Zhan M., Zhang G. , Zhao Q., Zheng L. , 

RA Zheng X.H., Zhong F.N., Zhong W. , Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A., Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster."; 

RL Science 287:2185-2195(2000). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Celniker S.E., Adams M.D., Kronmiller B., Wan K.H., Holt R.A., 

RA Evans C.A., Gocayne J.D., Amanatides P.G., Brandon R.C., Rogers Y., 

RA Banzon J., An H., Baldwin D., Banzon J., Beeson K.Y., Busam D.A. , 

RA Carlson J.W., Center A., Champe M. , Davenport L.B., Dietz S.M., 

RA Dodson K. , Dorsett V., Doup L.E., Doyle C. , Dresnek D., Farfan D., 

RA Ferriera S., Frise E., Galle R.F., Garg N.S., George R.A. , 



RA Gonzalez M. , Houck J., Hoskins R.A. Hostin D., Howland T.J., 

RA Ibegwam C, Jalali M. , Kruse D., Li P., Mattel B., Moshrefl A., 

RA Mcintosh T.C., Moy M. , Murphy B., Nelson Nelson K.A., Nunoo J.^ 

RA Pacleb J., Paragas V., Park S., Patel S., Pfeiffer B., 

RA Phouanenavong S.^ Pittman G.S., Purl V., Richards S., Scheeler F. , 

RA Stapleton M. ^ Strong R. , Svirskas R. ^ Tector Tyler D., 

RA Williams S.M., Zaveri J.S,, Smith H.O., Venter J.C., Rubin G.M. ; 

RT "Sequencing of Drosophila melanogaster genome."; 

RL Submitted {MAR-2000) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RA Misra S., Crosby M.A. , Matthews B.B., Bayraktaroglu L., Campbell K. , 

RA Hradecky P., Huang Y. , Kaminker J.S., Prochnik S.E., Smith CD., 

RA Tupy J.L., Bergman C, Berman B., Carlson J.W,, Celniker S.E., 

RA Clamp M. , Drysdale R. , Emmert D., Frise E,, de Grey A., Harris N., 

RA Kronmiller B., Marshall B., Millburn G., Richter J., Russo 3., 

RA Searle S.M.J., Smith E., Shu S., Smutniak F. , Whitfield E., 

RA Ashburner M, , Gelbart W.M. , Rubin G.M. , Mungall C.J., Lewis S.E.; 

RT "TVnnotation of Drosophila melanogaster genome."; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Adams M.D., Celniker S.E., Gibbs R.A. , Rubin G.M., Venter C.J.; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases, 

RN [5] 

RP SEQUENCE FROM N.A. 

RA FlyBase; 

RL Submitted (SEP-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AE003763; AAF56772,2; -, 

DR FlyBase; FBgn0039569; CG12872. 

SQ SEQUENCE 261 AA; 28401 MW; D5753D823160DAA6 CRC64; 

Query Match 100.0%; Score 42; DB 5; Length 261; 

Best Local Similarity 100.0%; Pred. No. 14; 

Matches 7; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

I I I I I I I 

Db 194 MQPPPLP 200 



RESULT 2 
Q8GPV8 

ID Q8GPV8 PRELIMINARY; PRT; 600 AA. 

AC Q8GPV8; 

DT Ol-MAR-2003 (TrEMBLrel, 23, Created) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT Ol-MAR-2003 (TrEMBLrel. 23, Last annotation update) 

DE Conserved hypothetical protein. 

OS Pseudomonas aeruginosa. 

OC Bacteria; Proteobacteria; Gammaproteobacteria; Pseudomonadales ; 

OC Pseudomonadaceae; Pseudomonas. 

OX NCBI_TaxID=287; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=SG17M; 

RX MEDLINE=22313472; PubMed=1242 6355 ; 



RA Larbig K.D., Christmann A., Johann A. ^ Klockgether J.^ Hartsch T., 

RA Merkl R. , Wiehlmann L., Fritz H.J., Tummler B. ; 

RT "Gene Islands Integrated into tRNA(Gly) Genes Confer Genome Diversi 

RT on a Pseudomonas aeruginosa Clone."; 

RL J. Bacteriol. 184:6665-6680(2 002). 

DR EMBL; AF440524; AAN62266.1; 

KW Hypothetical protein. 

SQ SEQUENCE 600 AA; 66371 MW; BF4 3C0A0E10FFF9D CRC64; 



Query Match 95.2%; 
Best Local Similarity 85.7%; 
Matches 5; Conservative 



Score 40; DB 2; 
Pred. No. 65; 
1; Mismatches 



Length 600; 
0; Indels 



0 ; Gap 



Qy 

Db 



1 MQPPPLP 7 

I I I I I : I 
460 MQPPPMP 466 



RESULT 3 
Q8RXF1 

ID Q8RXF1 PRELIMINARY; PRT; 785 AA. 

AC Q8RXF1; 

DT Ol-JUN-2002 (TrEMBLrel. 21, Created) 

DT Ol-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Splicing factor, putative. 

GN AT1G14650. 

OS Arabidopsis thaliana (Mouse-ear cress). 

OC Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta ; eudi cotyledons ; core eudicots; rosids 

OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=37 02 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Nguyen M. , Karlin-Neumann G., Southwick A., Lam B., Miranda M. , 

RA Palm C.J., Bowser L., Jones T., Banh J., Carninci P., Chen H. , 

RA Cheuk R. , Chung M.K., Hayashizaki Y. , Ishida J., Kamiya A., Kawai J 

RA Kim C, Lin J., Liu S.X., Narusaka M. , Pham P.K., Sakano H., 

RA Sakurai T., Satou M., Seki M. , Shinn P., Yamada K. , Shinozaki K., 

RA Ecker J., Theologis A., Davis R.W. ; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AY081293; AAL91182.1; -. 

DR GO; GO:0003723; F:RNA binding; lEA. 

DR GO; GO:0006396; P:RNA processing; lElA. 

DR InterPro; IPR000061; Surp. 

DR InterPro; IPR000626; Ubiquitin. 

DR Pfam; PF01805; Surp; 2. 

DR Pfam; PF00240; ubiquitin; 1. 

DR SMART; SM00 64 8; SWAP; 2. 

DR SMART; SM00213; UBQ; 1. 

DR PROSITE; PS50128; SURP; 2. 

DR PROSITE; PS50053; UBIQUITIN_2; 1. 

SQ SEQUENCE 785 AA; 87536 MW; Al 958C193E63BDC4 CRC64; 



Query Match 95.2%; Score 40; DB 10; Length 785; 

Best Local Similarity 85.7%; Pred. No. 84; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 



Qy 1 MQPPPLP 7 

I I I I I : I 

Db 652 MQPPPMP 658 



RESULT 4 
Q9MA2 0 

ID Q9MA2 0 PRELIMINARY; PRT; 177 6 AA. 

AC Q9MA2 0; 

DT Ol-OCT-2 000 (TrEMBLrel . 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE T5E21.13. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta; Enibryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots ; rosids ; 

OC eurosids II ; Brassicales ; Brassicaceae; Arabidopsis . 

OX NCBI_TaxID=37 02; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Chao Q., Brooks S., Buehler E., Johnson-Hopson C, Khan S., Kim C. , 

RA Shinn P., Tambunga G. , Altafi H., Bei Q., Chin C, Chiou J., Choi E. 

RA Conn L., Conway A., Gonzales A., Hansen N., Howng B., Koo T., Lam B. 

RA Lee J., Lenz C, Li J., Liu A., Liu K., Liu S., Mukharsky N., 

RA Nguyen M. , Palm C, Pham P., Sakano H., Schwartz J., Southwick A., 

RA Thaveri A., Toriumi M. , Vaysberg M. , Yu G., Federspiel N.A,, 

RA Theologis A., Ecker J.R.; 

RT "Genomic sequence for Arabidopsis thaliana BAC T5E21 from chromosome 

RT I . " ; 

RL Submitted (OCT-2000) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AC010657; AAF63169.1; -. 

DR PIR; G86280; G86280. 

DR HSSP; P02248; lUBI . 

DR GO; GO:0003723; F: RNA binding; lEA. 

DR GO; GO:0006396; P : RNA processing; lEA. 

DR InterPro; IPR000061; Surp. 

DR InterPro; IPR000626; Ubiquitin. 

DR Pfam; PF01805; Surp; 4. 

DR Pfam; PF00240; ubiquitin; 1. 

DR SMART; SM00648; SWAP; 4. 

DR SMART; SM00213; UBQ; 1, 

DR PROSITE; PS50128; SURP; 4. 

DR PROSITE; PS50053; UBIQUITIN_2; 1. 

SQ SEQUENCE 1776 AA; 200270 MW; C1C28169934E0A27 CRC64 ; 

Query Match 95.2%; Score 40; DB 10; Length 1776; 

Best Local Similarity 85.7%; Pred. No. 1.8e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 

Qy 1 MQPPPLP 7 

I I I I I : I 

Db 1643 MQPPPMP 1649 



RESULT 5 
Q9PF38 



ID Q9PF38 PRELIMINARY; PRT; 612 AA. 

AC Q9PF38; 

DT Ol-OCT-2000 (TrEMBLrel. 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-OCT~2003 (TrEMBLrel. 25, Last annotation update) 

DE Beta-galactosidase (EC 3.2.1.23) (Lactase). 

GN XF0840. 

OS Xylella fastidiosa. 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Xanthomonadales ; 

OC Xanthomonadaceae; Xylella. 

OX NCBI_TaxID==2371; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=9a5c; 

RX MEDLINE=20365717; PubMed^l 091034 7 ; 

RA Simpson A.J.G., Reinach F.C., Arruda P., Abreu F.A., Acencio M. , 

RA Alvarenga R. , Alves L.M.C., Araya J.E., Baia G.S., Baptista C.S., 

RA Barros M.H., Bonaccorsi E.D,, Bordin S,, Hove J.M., Briones M.R.S., 

RA Bueno M.R.P., Camargo A. A., Camargo L.E.A., Carraro D.M., Carrer H., 

RA Colauto N.B., Colombo C, Costa F.F., Costa M.C.R., Costa-Neto CM., 

RA Coutinho L.L., Cristofani M. , Dias-Neto E., Docena C. , El-Dorry H., 

RA Facincani A. P., Ferreira A.J.S., Ferreira V.C.A., Ferro J.A. , 

RA Fraga J.S., Franca S.C., Franco M.C., Frohme M. , Furlan L.R., 

RA Garnier M. , Goldman G.H., Goldman M.H.S., Gomes S.L., Gruber A., 

RA Ho P.L., Hoheisel J.D., Junqueira M.L., Kemper E.L,, Kitajima J. P., 

RA Krieger J.E., Kuramae E.E., Laigret F., Lambais M.R., Leite L.C.C., 

RA Lemos E.G.M., Lemos M.V.F,, Lopes S.A., Lopes C.R., Machado J. A., 

RA Machado M.A., Madeira A.M.B.N., Madeira H.M.F., Marino C.L., 

RA Marques M.V. , Martins E.A.L., Martins E.M.F., Matsukuma A.Y., 

RA Menck C.F.M., Miracca E.G., Miyaki C.Y., Monteiro-Vitorello C.B., 

RA Moon D.H., Nagai M.A., Nascimento A.L.T.O., Netto L.E.S., 

RA Nhani A. Jr., Nobrega E.G., Nunes L.R., Oliveira M.A. , 

RA de Oliveira M.C., de Oliveira R.C., Palmieri D.A., Paris A., 

RA Peixoto B.R., Pereira G.A.G., Pereira H.A. Jr., Pesquero J.B., 

RA Quaggio R.B., Roberto P.G., Rodrigues V., de Rosa A.J.M., 

RA de Rosa V.E. Jr., de Sa R.G., Santelli R.V., Sawasaki H.E., 

RA da Silva A.C.R., da Silva A.M., da Silva F.R., Silva W.A. Jr., 

RA da Silveira J.F., Silvestri M.L.Z., Siqueira W.J., de Souza A. A., 

RA de Souza A. P., Terenzi M.F., Truffi D., Tsai S.M., Tsuhako M.H., 

RA Vallada H., Van Sluys M. A. , Ver j ovs ki-Almeida S., Vettore A.L., 

RA Zago M.A. , Zatz M. , Meidanis J., Setubal J.C.; 

RT "The genome sequence of the plant pathogen Xylella fastidiosa."; 

RL Nature 406:151-159(2000). 

CC CATALYTIC ACTIVITY: HYDROLYSIS OF TERMINAL, NON-REDUCING BETA-D- 

CC GALACTOSE RESIDUES IN BETA-D-GALACTOSIDES . 

CC -!- SIMILARITY: BELONGS TO FAMILY 35 OF GLYCOSYL HYDROLASES. 

DR EMBL; AE003923; AAF83650.1; 

DR PIR; E82756; E82756. 

DR GO; GO: 0009341; C : beta-galactosidase complex; lEA. 

DR GO; GO: 0004565; F : beta-galactosidase activity; lEA. 

DR GO; GO: 0016798; F: hydrolase activity, acting on glycosyl bonds; lEA. 

DR GO; GO:0005975; P : carbohydrate metabolism; lEA. 

DR InterPro; IPR008979; Gal_bind_like . 

DR InterPro; IPR001944; Glyco_hydro_35 . 

DR Pfam; PF01301; Glyco_hydro_35 ; 1. 

DR PRINTS; PR00742; GLHYDRLASE35 . 

DR PROSITE; PS01182; GLYCOSYL HYDROL F35; 1, 



KW Glycosidase; Hydrolase; Complete proteome. 

SQ SEQUENCE 612 AA; 67893 MW; B894D6A3EB789BCB CRC64; 



Query Match 92.9%; Score 39; DB 16; Length 612; 

Best Local Similarity 85.7%; Pred. No. 98; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 356 LQPPPLP 362 



RESULT 6 
Q87AJ2 

ID Q87AJ2 PRELIMINARY; 

AC Q87AJ2; 

DT Ol-JUN-2003 (TrEMBLrel. 24, 

DT Ol-JUN-2003 (TrEMBLrel. 24, 

DT Ol-OCT-2003 (TrEMBLrel. 25, 

DE Beta-galactosidase , 

GN BGA OR PD1833. 

OS Xylella fastidiosa (strain Temeculal / ATCC 700964). 

OC Bacteria; Proteobacteria ; Gammaproteobacteria ; Xanthomonadales ; 

OC Xanthomonadaceae; Xylella. 

OX NCBI_TaxID-183190; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22421331; PubMed=12533478 ; 

RA Van Sluys M.A. , de Oliveira M.C., Monteiro-Vitorello C.B., 

RA Miyaki C.Y., Furlan L.R., Camargo L.E.A. , da Silva A.C.R., Moon D.H 

RA Takita M.A. , Lemos E.G.M., Machado M.A., Ferro M.I.T., da Silva F.R 

RA Goldman M.H.S., Goldman G.H,, Lemos M.V.F., El-Dorry H., Tsai S.M., 

RA Carrer H., Carraro D.M. , de Oliveira R.C, Nunes L.R., Siqueira W.J 

RA Coutinho L.L., Kimura E.T., Ferro E.S., Harakava R., Kuramae E.E., 

RA Marino C.L., Giglioti E., Abreu I.L., Alves L.M.C., do Amaral A.M., 

RA Baia G.S., Blanco S.R., Brito M.S., Cannavan F.S., Celestino A.V. , 

RA da Cunha A.F., Fenille R.C., Ferro J. A., Formighieri E.F., Kishi L. 

RA Leoni S.G., Oliveira A.R., Rosa V.E. Jr., Sassaki F.T., Sena J.A.D. 

RA de Souza A. A., Truffi D., Tsukumo F. , Yanai G.M., Zaros L.G., 

RA Civerolo E.L., Simpson A.J.G., Almeida N.F. Jr., Setubal J.C., 

RA Kitajima J. P. ; 

RT "Comparative analyses of the complete genome sequences of Pierce's 

RT disease and citrus variegated chlorosis strains of Xylella 

RT fastidiosa."; 

RL J. Bacterid. 185:1018-1026(2003). 

DR EMBL; AE012560; AA029665.1; -. 

DR GO; GO: 0009341; C : beta-galactosidase complex; lEA. 

DR GO; GO: 0004565; F:beta-galactosidase activity; lEA. 

DR GO; GO: 0005975; P : carbohydrate metabolism; lEA. 

DR InterPro; IPR008979; Gal_bind_like . 

DR InterPro; IPR001944; Glyco_hydro_35 . 

DR Pfam; PF01301; Glyco_hydro_35 ; 1. 

DR PRINTS; PR00742; GLHYDRLASE35 . 

DR PROSITE; PS01182; GLYCOSYL_HYDROL_F35 ; 1. 

KW Complete proteome. 

SQ SEQUENCE 612 AA; 67992 MW; 2B66663591EB556B CRC64; 



PRT; 612 AA. 
Created) 

Last sequence update) 
Last annotation update) 



Query Match 92.9%; Score 39; DB 16; Length 612; 

Best Local Similarity 85.7%; Pred. No. 98; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 356 LQPPPLP 362 



RESULT 7 
Q9GRR6 

ID Q9GRR6 PRELIMINARY; PRT; 1404 AA. 

AC Q9GRR6; 

DT Ol-MAR-2001 (TrEMBLrel. 16, Created) 

DT Ol-MAR-2001 (TrEMBLrel. 16, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein (Fragment) . 

GN L5882.01. 

OS Leishmania major, 

OC Eukaryota; Euglenozoa; Kinetoplastida; Trypanosomatidae; Leishmania 

OX NCBI_TaxID=5664; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Friedlin; 

RA Masuy D., Purnelle B. , Goffeau A., Ivens A.C., Quail M. , 

RA Ra jandream M. A. , Barrell E.G.; 

RL Submitted (OCT-2000) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Friedlin; 

RX MEDLINE=9814 64 35; PubMed=9477 341; 

RA Ivens A.C., Lewis S.M., Bagherzadeh A., Zhang L., Chan H.M., 

RA Smith D. F. ; 

RT "A physical map of the Leishmania major Friedlin genome.*'; 

RL Genome Res. 8:135-145(1998). 

DR EMBL; AL445943; CAC14319.1; 

KW Hypothetical protein. 

FT NON_TER 1 1 

SQ SEQUENCE 1404 AA; 152867 MW; 2 5F42AB67 02B9D3C CRC64; 

Query Match 92.9%; Score 39; DB 5; Length 1404; 

Best Local Similarity 85.7%; Pred. No. 2.1e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 

Qy 1 MQPPPLP 7 

: I I I I I I 

Db 1142 LQPPPLP 1148 



RESULT 8 

Q9SGY5 

ID Q9SGY5 

AC Q9SGY5; 

DT Ol-MAY-2000 

DT 01-MAY-2000 

DT 01-JUN-2003 

DE F20B24.8. 



PRELIMINARY; 

(TrEMBLrel. 13, 
(TrEMBLrel. 13, 
(TrEMBLrel. 24, 



PRT; 



542 AA. 



Created) 

Last sequence update) 
Last annotation update) 



OS Arabidopsis thaliana (Mouse-ear cress) , 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta; 

OC Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; rosids; 

OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=37 02 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Shinn P., Brooks S,, Buehler E., Chao Q., Johnson-Hopson C, Khan S., 

RA Kim C, Altafi H., Bei Q., Chin C, Chiou J., Choi E., Conn L,, 

RA Conway A., Gonzales A., Hansen N., Rowing B. , Koo T., Lam B., Lee J., 

RA Lenz C, Li J., Liu A., Liu K. , Liu S., Mukharsky N.^ Nguyen M., 

RA Palm C, Pham P., Sakano H., Schwartz J., Southwick A., Thaveri A,, 

RA Toriumi M. , Vaysberg M., Yu G. , Federspiel N.A., Theologis A., 

RA Ecker J,R. ; 

RT "Genomic sequence for Arabidopsis thaliana BAG F20B24 from chromosome 

RT I . " ; 

RL Submitted (OCT-2000) to the EMBL/GenBank/DDBJ databases. 

CC -!- SIMILARITY: BELONGS TO FAMILY 28 OF GLYCOSYL HYDROLASES 

CC (POLYGALACTURONASES) . 

DR EMBL; AC009398; AAF17670.1; 

DR FIR; H86239; H86239. 

DR GO; GO: 0005618; Crcell wall; lEA. 

DR GO; GO: 0016798; Frhydrolase activity, acting on glycosyl bonds; lEA. 

DR GO; GO: 0004650; F : polygalacturonase activity; lEA. 

DR GO; GO:0005975; P: carbohydrate metabolism; lEA. 

DR InterPro; IPR000743; Glyco_hydro_28 . 

DR InterPro; IPR006626; PbHl . 

DR Pfam; PF00295; Glyco_hydro_28 ; 1. 

DR SMART; SM00710; PbHl; 4. 

DR PROSITE; PS00502; POLYGALACTURONASE; 1. 

KW Cell wall; Glycosidase; Hydrolase, 

SQ SEQUENCE 542 7\A; 59705 MW; 7C67D0C3BCD19BC8 CRC64; 

Query Match 90.5%; Score 38; DB 10; Length 542; 

Best Local Similarity 85.7%; Pred. No. 1.3e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I t I I I 
Db 105 VQPPPLP 111 



RESULT 9 
Q8TES7 



PRELIMINARY; 



ID Q8TES7 

AC Q8TES7; 

DT Ol-JUN-2002 (TrEMBLrel. 21, 

DT Ol-JUN-2002 (TrEMBLrel. 21, 

DT Ol-JUN-2002 (TrEMBLrel. 21, 

DE FLJ00103 protein (Fragment) . 

GN FLJ00103. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; 

OC Mammalia; Eutheria; Primates; 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 



PRT; 



678 AA. 



Created) 

Last sequence update) 
Last annotation update) 



Craniata ; Vertebrata ; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



RC TISSUE=Spleen; 

RA Ohara O., Nagase T., Kikuno R. , Okumura K. ; 

RT "The nucleotide sequence of a long cDNA clone isolated from human 

RT spleen. 

RL Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AK074045; BAB84871.1; -. 

FT NON_TER 1 1 

SQ SEQUENCE 678 AA; 70813 MW; 4 84 7E4 7E5 6F5D69 6 CRC64; 

Query Match 90.5%; Score 38; DB 4; Length 678; 

Best Local Similarity 85.7%; Pred. No. l,6e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 

Qy 1 MQPPPLP 7 

: I I I I I I 
Db 652 VQPPPLP 658 



RESULT 10 
Q8LIE4 

ID Q8LIE4 PRELIMINARY; PRT; 812 AA. 

AC Q8LIE4; 

DT Ol-OCT-2002 (TrEMBLrel. 22, Created) 

DT Ol-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE OJ1316_A04 .20 protein (P0427D10.3 protein). 

GN OJ1316_A04 .20 OR P0427D10.3. 

OS Oryza sativa (japonica cultivar-group) . 

OC Eukaryota; Viridiplantae ; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta ; Magnoliophyta ; Liliopsida; Poales; Poaceae; 

OC Ehrhartoideae; Oryzeae; Oryza. 

OX NCBI_TaxID=39947; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN==cv. Nipponbare; 

RA Sasaki T., Matsumoto T., Yamamoto K. ; 

RT "Oryza sativa nipponbare (GA3 ) genomic DNA, chromosome 1, BAG 

RT clone :OJ1316_A04. "; 

RL Submitted (JUL-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Nipponbare; 

RA Sasaki T., Matsumoto T., Yamamoto K. ; 

RT "Oryza sativa nipponbare ( GA3 ) genomic DNA, chromosome 7, PAC 

RT clone: P0427D10. "; 

RL Submitted (OCT~2001) to the EMBL/GenBank/DDBJ databases. 

DR EMBL; AP003822; BAC07001.1; 

DR EMBL; AP004272; BAC20038.1; -. 

DR Gramene; Q8LIE4; -. 

DR InterPro; IPR008938; ARM, 

SQ SEQUENCE 812 AA; 87347 MW; B58C5223F0385634 CRC64; 

Query Match 90.5%; Score 38; DB 10; Length 812; 

Best Local Similarity 85.7%; Pred. No. 1.9e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gap 



Qy 



1 MQPPPLP 7 



Db 26 VQPPPLP 32 



RESULT 11 




Q8T1F5 




ID 


Q8T1F5 PRELIMINARY; PRT; 2165 AA. 




AC 


Q8T1F5; 




DT 


Ol-JUN-2002 (TrEMBLrel. 21, Created) 




DT 


Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 




DT 


Ol-OCT-2003 (TrEMBLrel. 2b, Last annotation update) 




DE 


Similar to Dictyostelium discoideum (Slime mold) . protein tyrosine 


DE 


kinase . 




OS 


Dictyostelium discoideum (Slime mold) . 




OC 


Eukaryota; Mycetozoa; Dictyos teliida ; Dictyostelium. 




OX 


NCBI TaxID=44689; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=AX4; 




RX 


MEDLINE=22092622; PubMed=12097 910; 




RA 


Gloeckner G. , Eichinger L., Szafranski K. , Pachebat J., 


Dear P., 


RA 


Lehmann R., Baumgart C, Parra G., April J.F., Guigo R., 


Kump f K . , 


RA 


Tunggal B., Cox E., Quail M.A., Platzer M., Rosenthal A. 


, Noegel A 


RT 


"Sequence and analysis of chromosome 2 of Dictyostelium discoideum 


RL 


Nature 418:79-85(2002) . 




RN 


[2] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=AX4; 




RA 


Baumgart C. ; 




RL 


Submitted (MAR-2003) to the EMBL/GenBank/DDBJ databases. 




DR 


EMBL; AC116959; AAM08443.2; -. 




DR 


GO; GO: 0005524; FrATP binding; lEA. 




DR 


GO; GO: 0004674; F;protein serine/threonine kinase activity; lEA. 


DR 


GO; GO: 0004713; F: protein-tyrosine kinase activity; lEA. 




DR 


GO; GO: 0007242; P : intracellular signaling cascade; lEA. 




DR 


GO; GO: 0006468; P;protein amino acid phosphorylation; lEA. 


DR 


InterPro; IPR001849; PH. 




DR 


InterPro; IPR000719; Prot_kinase. 




DR 


InterPro; IPR001683; PX. 




DR 


InterPro; IPR002290; Ser_thr pkinase. 




DR 


InterPro; IPR008271; Ser_thr_pkin_AS , 




DR 


InterPro; IPR001245; Tyr_pkinase. 




DR 


PRINTS; PR00109; TYRKINASE. 




DR 


ProDom; PDOOOOOl; Prot_kinase; 1. 




DR 


SMART; SM00220; S_TKc; 1. 




DR 


SMART; SM00219; TyrKc; 1. 




DR 


PROSITE; PS50003; PH_DOMAIN; 1. 




DR 


PROSITE; PS00107; PROTEIN_KINASE_ATP ; 1. 




DR 


PROSITE; PS50011; PROTEIN KINASE DOM; 1. 




DR 


PROSITE; PS00108; PROTEIN KINASE ST; 1. 




DR 


PROSITE; PS50195; PX; 1. 




KW 


Kinase . 




SQ 


SEQUENCE 2165 AA; 241910 MW; AFFF8E5B1C32202F CRC64; 





Query Match 90.5%; Score 38; DB 5; Length 2165; 

Best Local Similarity 85.7%; Pred. No. 4.6e+02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 



Qy 1 MQPPPLP 7 

: I I I I M 

Db 1284 IQPPPLP 1290 



RESULT 12 
Q8IC57 
ID Q8IC57 
Q8IC57; 
Ol-MAR-2 003 
Ol-MAR-2 0 03 
Ol-OCT-2003 



PRELIMINARY; 



PRT; 2879 AA. 



AC 
DT 
DT 
DT 
DE 
GN 
OS 
OC 
OX 
RN 
RP 
RA 
RA 
RA 
RL 
DR 
DR 
DR 
DR 
DR 
SQ 



Created) 

Last sequence update) 
Last annotation update) 
(PfEMPl) . 



(TrEMBLrel. 23, 
(TrEMBLrel. 23, 
(TrEMBLrel. 25, 
Erythrocyte membrane protein 1 
VAR. 

Plasmodium falciparum (isolate 3D7). 
Eukaryota; Alveolata; Apicomplexa; Haemosporida 
NCBI_TaxID-36329; 
[1] 

SEQUENCE FROM N.A. 
Cherevach I., Davis P 
Berriman M, , Pain A. , 
Ormond D., Sanders M. 
Submitted (SEP-2002) 



Plasmodium, 



Goodhead I., Stevens K. , Mungall K., 
Hall N., Atkin R. , Chillingworth C, Doggett J., 
, Hayes R. , Hall S., Quail M., Barrell B.; 
to the EMBL/GenBank/DDBJ databases. 



EMBL; AL844505; CAD50587.1; -. 

GO; GO: 0005539; F : glycosaminoglycan binding; lEA. 
GO; GO: 0009405; P : pathogenesis ; lEA. 
InterPro; IPR004258; PFEMP. 
Pfam; PF03011; PFEMP; 1. 

SEQUENCE 2879 AA; 333477 MW; 6A2EEACE2C92C6C1 CRC64 ; 



Query Match 90.5%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 38; DB 5; Length 2879; 
Pred. No. 6e+02; 
1; Mismatches 0; Indels 0; 



Gaps 



0; 



Qy 1 MQPPPLP 7 

: I I I I I I 

Db 2429 VQPPPLP 2435 



RESULT 13 


Q85G37 


ID 


Q85G37 PRELIMINARY; PRT; 44 AA. 


AC 


Q85G37; 


DT 


Ol-JUN-2003 (TrEMBLrel. 24, Created) 


DT 


Ol-JUN-2003 (TrEMBLrel. 24, Last sequence update) 


DT 


Ol-OCT-2003 (TrEMBLrel. 25, Last annotation update) 


DE 


ORF44 . 


OS 


Cyanidioschyzon merolae (Red alga) . 


OG 


Chloroplast . 


OC 


Eukaryota ; Rhodophyta ; Bangiophyceae ; Cyanidios chyzon . 


OX 


NCBI TaxID=45157; 


RN 


[1] 


RP 


SEQUENCE FROM N.A. 


RC 


STRAIN=10D; 


RX 


MEDLINE=22639682; PubMed-127 55171; 


RA 


Ohta N., Matsuzaki M. , Misumi 0., Miyagishima S., Nozaki H., 



RA Tanaka K. , Shin-i T., Kohara Y., Kuroiwa T. ; 

RT "Complete Sequence and Analysis of the Plastid Genome of the 

RT Unicellular Red Alga Cyanidioschyuzon merolae."; 

RL DNA Res. 10:67-77(2003). 

DR EMBL; AB002583; BAC76154.1; 

DR GO; GO: 0009507; C : chloroplas t ; lEA. 

KW Chloroplast . 

SQ SEQUENCE 44 AA; 4903 MW; A38CAD0D007AA7B0 CRC64; 

Query Match 88.1%; Score 37; DB 8; Length 44; 

Best Local Similarity 100.0%; Pred. No. 18; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 

Qy 2 QPPPLP 7 

I I I I I I 

Db 27 QPPPLP 32 



RESULT 14 




Q87610 




ID 


Q87610 PRELIMINARY; PRT; 72 AA. 




AC 


Q87610; 




DT 


Ol-NOV-1996 (TrEMBLrel. 01, Created) 




DT 


Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 




DT 


Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Rev protein (Fragment) . 




GN 


REV. 




OS 


Chimpanzee immunodeficiency virus (SIV(cpz)) (CIV). 




OC 


Viruses; Retroid viruses; Retroviridae; Lentivirus . 




OX 


NCBI TaxID=11723; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=P031; 




RX 


MEDLINE=97138 325; PubMed=8 9 85351 ; 




RA 


Bibollet-Ruche F. , Brengues C, Galat-Luong A., Galat 


G. , Pourrut X 


RA 


Vidal N., Veas F., Durand J. P., Cuny G. ; 




RT 


"Genetic diversity of simian immunodeficiency viruses 


from West 


RT 


African green monkeys : evidence of multiple genotypes 


within 


RT 


populations from the same geographical locale."; 




RL 


J. Virol. 71:307-313(1997). 




DR 


EMBL; U37213; AAC56168.1; 




DR 


GO; GO: 0042025; C:host cell nucleus; lEA. 




DR 


GO; GO: 0003700; F : transcription factor activity; lEA. 




DR 


GO; GO: 0006355; P:regulation of transcription, DNA-dependent ; lEA. 


DR 


InterPro; IPR000625; REV protein. 




DR 


Pfam; PF00424; REV; 1. 




FT 


NON_TER 1 1 




FT 


NON TER 72 72 




SQ 


SEQUENCE 72 AA; 8188 MW; B7A8 027FA9BA432A CRC64; 





Query Match 88.1%; Score 37; DB 15; Length 72; 

Best Local Similarity 100.0%; Pred. No. 29; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 2 QPPPLP 7 

I I I I I I 

Db 58 QPPPLP 63 



RESULT 15 




Q87603 




ID 


Q87603 PRELIMINARY; PRT; / AA. 




AC 


Q87603; 




DT 


Ol-NOV-1996 (TrEMBLrel. 01, Createa) 




DT 


Ol-NOV-1996 (TrEMBLrel. 01, Last sequence update) 




DT 


Ol-JUN-2003 (TrEMBLrel. 24, Last annotation update) 




DE 


Rev protein (Fragment) . 




GN 


REV. 




OS 


Chimpanzee immunodeficiency virus (SIV(cpz)) (CIV). 




oc 


Viruses; Retroid viruses; Retroviridae; Lentivirus. 




ox 


NCBI_TaxID=1172 3; 




RN 


[1] 




RP 


SEQUENCE FROM N.A. 




RC 


STRAIN=P058; 




RX 


MEDLINE-97 138325; PubMed=8 98 5351 ; 




RA 


Bibollet-Ruche F. , Brengues C, Galat-Luong A., Galat 


G. , Pourrut X 


RA 


Vidal N., Veas F., Durand J. P., Cuny G. ; 




RT 


"Genetic diversity of simian immunodeficiency viruses 


from West 


RT 


African green monkeys: evidence of multiple genotypes 


within 


RT 


populations from the same geographical locale."; 




RL 


J. Virol. 71:307-313(1997). 




DR 


EMBL; U37210; AAC56159.1; -. 




DR 


GO; GO: 0042025; C:host cell nucleus; lEA. 




DR 


GO; GO: 0003700; F: transcription factor activity; lEA. 




DR 


GO; GO: 0006355; P: regulation of transcription, DNA-dependent ; lEA. 


DR 


InterPro; IPR000625; REV_protein. 




DR 


Pfam; PF00424; REV; 1. 




FT 


NON TER 1 1 




SQ 


SEQUENCE 73 AA; 8217 MW; A3 1FDD937DC9BB4D CRC64 ; 





Query Match 88.1%; Score 37; DB 15; Length 73; 

Best Local Similarity 100.0%; Pred. No. 29; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gap 



Qy 2 QPPPLP 7 

I I M I I 

Db 58 QPPPLP 63 



Search completed: August 24, 2004, 15:50:24 
Job time : 27.6269 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table : 



August 24, 2004, 14:57:04 ; Search time 3.76119 Seconds 

(without alignments) 
96.908 Million cell updates/sec 

US-09-641-801-1 
42 

1 MQPPPLP 7 
BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database 



SwissProt 42:*^ 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 



Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


39 


92. 


,9 


222 


1 


COLI_THUOB 


Q9ygk2 


t corticotr 


2 


39 


92, 


. 9 


701 


1 


CG1_HUMAN 


Q13495 


homo sapien 


3 


39 


92, 


, 9 


1074 


1 


EMl2_MOUSE 


Q8k482 


mus musculu 


4 


38 


90. 


, 5 


536 


1 


ARP_ARATH 


P45951 


arabidopsis 


5 


37 


88. 


, 1 


167 


1 


SERO_GALME 


076192 


galleria me 


6 


37 


88. 


, 1 


211 


1 


RGM1_YEAST 


Q00453 


saccharomyc 


7 


37 


88, 


.1 


278 


1 


TNF6_RAT 


P36940 


rattus norv 


8 


37 


88. 


.1 


330 


1 


PEXD_CAEEL 


Q19951 


caenorhabdi 


9 


37 


88. 


.1 


393 


1 


WR25_ARATH 


022921 


arabidopsis 


10 


37 


88. 


.1 


483 


1 


B1AR__MELGA 


P07700 


meleagris g 


11 


37 


88. 


.1 


562 


1 


ILVD_PYRAE 


Q8zyu6 


pyrobaculum 


12 


37 


88. 


. 1 


620 


1 


EXTN TOBAC 


P13983 


nicotiana t 


13 


37 


88. 


. 1 


649 


1 


DREB_HUMAN 


Q16643 


homo sapien 


14 


37 


88. 


.1 


745 


1 


AXN DROME 


Q9v407 


drosophila 


15 


37 


88. 


.1 


830 


1 


DYNl^CAEEL 


P39055 


caenorhabdi 


16 


37 


88, 


.1 


1027 


1 


DOCl_MOUSE 


Q8bur4 


mus musculu 


17 


37 


88, 


.1 


1032 


1 


S24D HUMAN 


094855 


homo sapien 



18 


37 


88 . 


1 


1124 


1 


i Or 0 hUNAlN 


D "5 ^7 0 7 ^ 


homo S3pisn 


19 


37 


88 . 


1 


1557 


1 


TTTvjTKT O T\;r/^T T O 'C 

r MN Z JxlO U b hj 


nQ ^ 1 r\ A 
y y J X u 


mus inusculu 


20 


37 


8 8 . 


1 


3312 


1 


CLR3 HUMAN 


yynyq / 


homo sHpien 


21 


36 


85 . 


7 


463 


1 


YAD4 YEAST 


T)0 Q r\ r\ o 
rZ 0 U Uo 


saccharomyc 


22 


36 


85 . 


7 


505 


1 


WASL BOVIN 


yyolu / 


bos taurus 


23 


36 


85 . 


7 


62 9 


1 


RA21 XENLA 


uy o o 1 u 


xenopus lae 


24 


36 


85 . 


7 


736 


1 


DVIj2 XENLA 




xenopus lae 


25 


36 


85 . 


7 


823 


1 


NUCl NEUCR 


D o n Q O /I 


neurospora 


26 


36 


85 . 


7 


1484 


1 


CES2 HUMAN 


QyjDXt O 


homo sapien 


27 


35 


83 . 


3 


175 


1 


OE55 NPVHZ 




heliothis z 


28 


35 


83 . 


3 


414 


1 


PRS6 CAEEL 




caenorhabdi 


29 


35 


83 . 


3 


533 


1 


MUTL THEAQ 


T~i n n o o 
Py DUOZ 


thermus aqu 


30 


35 


83 . 


3 


58 7 


1 


ACJ\Z YEAb 1 


D /l fl R R 
r 4 U OoD 


s accharomyc 


31 


35 


83 . 


3 


58 8 


1 


DABl MOUSE 


o Q 1 Q 1 Q 

r y / o i o 


mus musculu 


32 


35 


83 . 


3 


639 


1 


SFOl HUMAN 




homo sapien 


33 


35 


83 . 


3 


653 


1 


SEOl MOUSE 




mus musculu 


34 


35 


83 . 


3 


666 


1 


*[~>/^TTT\ TTT7>T HvT/^ 

BCHD HELMO 


("1 r-T A £r 

y y zge o 


heliobacill 


35 


35 


83 . 


3 


808 


1 


YUdd NPVAC 


■n /I 1 /I C "7 
P4 1 4 D / 


autographa 


36 


35 


83 . 


3 


1271 


1 


RBMG HUMAN 


Q9upn6 


homo sapien 


37 


35 


83 . 


3 


2476 


1 


ATRX MOUSE 


Q61687 


mus musculu 


38 


35 


83 . 


3 


2492 


1 


ATRX HUMAN 


P4 510 0 


homo sapien 


39 


35 


83 . 


3 


2492 


1 


ATRX_P7\NTR 


Q7yqra4 


pan troglod 


40 


35 


83. 


3 


2492 


1 


ATRX PONPY 


Q7yqin3 


pongo pygma 


41 


34 


81. 


0 


91 


1 


Y87 9_MYCTU 


Q10541 


mycobacteri 


42 


34 


81. 


0 


193 


1 


PGCl PIG 


Q95250 


sus scrofa 


43 


34 


81. 


0 


194 


1 


PGC1_HUMAN 


000264 


homo sapien 


44 


34 


81. 


0 


194 


1 


PGCl_MOUSE 


055022 


mus musculu 


45 


34 


81. 


0 


194 


1 


PGCl RAT 


P70580 


rattus norv 



ALIGNMENTS 



RESULT 1 
COLI_THUOB 

ID COLI__THUOB STANDARD; PRT; 222 AA. 

AC Q9YGK2; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Corticotropin-lipotropin precursor ( Pro-opiomelanocortin) (POMC) 

DE [Contains: Corticotropin (Adrenocorticotropic hormone) (ACTH) ; 

DE Melanotropin alpha (Alpha-MSH) ; Corticotropin-like intermediary 

DE peptide (CLIP) ; Lipotropin beta (Beta-LPH) ; Lipotropin gamma (Gamma- 

DE LPH) ; Melanotropin beta (Beta-MSH) ; Beta-endorphin; Met-enkephalin] . 

OS Thunnus obesus (Bigeye tuna) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Actinopterygii ; Neopterygii; Teleostei; Euteleostei; Neoteleostei ; 

OC Acanthomorpha; Acanthopterygii ; Percomorpha; Perciformes; Scombroidei; 

OC Scombridae; Thunnus. 

OX NCBI_TaxID=8241; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Pituitary; 

RA Amemiya Y., Takahashi A., Kawauchi H. ; 

RT "Tuna proopiomelanocortin cDNA."; 

RL Submitted (DEC-1998) to the EMBL/GenBank/DDBJ databases. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



PTM: Specific enzymatic cleavages at paired basic residues yield 
the different active peptides. 
SIMILARITY: Belongs to the POMC family. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AB02 0 
InterPro; I 
Pfam; PF009 
PRINTS; PRO 
ProDom; PDO 
Endorphin 
SIGNAL 
PEPTIDE 
PEPTIDE 
PEPTIDE 
PEPTIDE 
PEPTIDE 
PEPTIDE 
PEPTIDE 
PEPTIDE 
DOMAIN 
DOMAIN 
SEQUENCE 



971; BAA35125.1; -. 
PR001941; Mcortin_ACTH. 
76; ACTH_domain; 1. 
0383; MELANOCORTIN. 
03250; Mcortin_ACTH; 1. 

Hormone; Cleavage on pair of basic residues; Signal. 



1 

93 
93 
111 
136 
136 
172 
191 
191 
71 
80 
222 



AA; 



18 POTENTIAL. 

132 CORTICOTROPIN. 

107 MELANOTROPIN ALPHA. 

132 CORTICOTROPIN-LIKE INTERMEDIARY PEPTIDE. 

222 LIPOTROPIN BETA. 

188 LIPOTROPIN GAMMA. 

18 8 MELANOTROPIN BETA. 

222 BETA- ENDORPHIN. 

195 MET-ENKEPHALIN. 

75 POLY-SER. 

87 POLY-SER. 

24970 MW; 7 868C7 13D6360B70 CRC64; 



Query Match 92.9%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 39; DB 1; Length 222; 
Pred. No. 23; 
1; Mismatches 0; Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 MQPPPLP 7 

: M I I I I 
62 LQPPPLP 6^ 



RESULT 2 
CG1_HUMAN 

ID CG1_HUMAN STANDARD; PRT; 7 01 AA. 

AC Q13495; 

DT Ol-NOV-1997 (Rel. 35, Created) 

DT Ol-NOV-1997 (Rel. 35, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE CGI protein (F18) . 

GN CX0RF6 OR CGI. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI__TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96225444; PubMed=8 64 0223 ; 

RA Laporte J., Hu L.-J., Kretz C, Mandel J.-L., Kioschis P., Coy J., 



RA 
RT 
RT 
RL 
CC 

cc 

CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
FT 
FT 
FT 
SQ 



Klauck S.M., Poutska A., Dahl N.; 

"A gene mutated in X-linked myotubular myopathy defines a new 
putative tyrosine phosphatase family conserved in yeast."; 
Nat. Genet. 13:175-182(1996). 

TISSUE SPECIFICITY: Preferentially expressed in skeletal muscle. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; U46023; AAC50551.1; 
Genew; HGNC:2568; CXorf6. 
MIM; 300120; -. 
DOMAIN 275 
DOMAIN 419 
DOMAIN 522 
SEQUENCE 701 AA; 



286 
429 
533 
74475 MW; 



POLY- PRO. 
POLY-GLN. 
POLY-GLN. 

AD1C18E4F6DFD34E CRC64; 



Query Match 92.9%; 
Best Local Similarity 85.7%; 
Matches 6; Conservative 



Score 39; DB 1; 
Pred. No. 77; 

1; Mismatches 



Length 7 01; 
0; Indels 



0; Gaps 



0; 



Qy 



Db 



1 MQPPPLP 7 

: I I I I I I 
273 LQPPPLP 279 



RESULT 3 
EMI2_M0USE 

ID EMI2_M0USE STANDARD; PRT; 1074 AA. 

AC Q8K482; 

DT lO-OCT-2003 (Rel. 42, Created) 

DT lO-OCT-2003 (Rel. 42, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE EMILIN 2 precursor (Elastin microfibril interface-located protein 2) 

DE (Elastin microfibril interfacer 2) (Basilin) . 

GN EMILIN2. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A., AND CHARACTERIZATION. 

RC STRAIN=C57BL/6J; TISSUE=Cochlea; 

RX MEDLINE=22722101; PubMed=12 837 62 9 ; 

RA Amma L.L., Goodyear R. , Faris J.S., Jones I., Ng L., Richardson G. , 

RA Forrest D. ; 

RT "An emilin family extracellular matrix protein identified in the 

RT cochlear basilar membrane."; 

RL Mol. Cell. Neurosci. 23:460-472(2 003). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Lirab; 



RX MEDLINE=22388257; PubMed=12477 932 ; 

RA Strausberg R.L.;. Feingold E.A. ^ Grouse Derge J.G,, 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul Zeeberg B. , Buetow Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L, , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C., 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J,, McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15, 000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2 002). 

RN [3] 

RP DEVELOPMENTAL STAGE. 

RX MEDLINE-222 09197; PubMed=1222 1002 ; 

RA Leimeister C., Steidl C., Schumacher N., Erhard S., Gessler M. ; 

RT "Developmental expression and biochemical characterization of Emu 

RT family members . " ; 

RL Dev. Biol. 249:204-218(2002). 

CC -!- FUNCTION: May be responsible for anchoring smooth muscle cells to 

CC elastic fibers, and may be involved not only in the formation of 

CC the elastic fiber, but also in the processes that regulate vessel 

CC assembly. Has cell adhesive capacity. Major component of the 

CC cochlear basilar membrane (BM) which may contribute to the 

CC developmental assembly or function of the BM. 

CC -!- SUBUNIT: Homotrimer associated through a moderately stable 

CC interaction of the C-terminal globular Clq domains, allowing the 

CC nucleation of the triple helix and then a further quaternary 

CC assembly to higher order polymers via intermolecular disulfide 

CC bonds (By similarity). Interacts with EMILINl. 

CC -!- SUBCELLULAR LOCATION: Extracellular matrix. Found mainly at the 

CC interface between amorphous elastin and microfibrils. 

CC -!- TISSUE SPECIFICITY: Highest levels are present in cochlea of P8 

CC pups, followed by modest levels in adult heart and lung, and much 

CC lower levels in forebrain, brainstem, cerebellum and hypothalamus. 

CC Very low levels detected in muscle, liver, kidney and eye. 

CC -!- DEVELOPMENTAL STAGE: Low levels detected in cochlea in neonatal 

CC pups at PI. Levels increased 2-fold by P5 and rose further to 16- 

CC fold at P13. Expression declined somewhat in adult mice. At E9.5, 

CC as during all stages of development, it is strongly expressed in 

CC the neural fold, the limbbuds and the heart. 

CC -!- SIMILARITY: Contains 1 EMI domain. 

CC -!- SIMILARITY: Contains 1 collagenous domain. 

CC -!- SIMILARITY: Contains 1 CIQ domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute, There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF468645; AAM53532.1; 

DR EMBL; BC053753; AAH53753.1; 

DR MGD; MGI: 2389136; Emilin2 . 

DR InterPro; IPR001073; Clq. 

DR InterPro; IPR008983; TNF_like. 

DR Pfam; PF00386; Clq; 1. 

DR SMART; SMOOllO; ClQ; 1. 

KW Cell adhesion; Extracellular matrix; Signal; Coiled coil; Collagen; 
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(POTENTIAL) . 


FT 
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(POTENTIAL) . 


FT 
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514 


514 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


586 
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N-LINKED 


( GLCNAC . . . ) 


(POTENTIAL) . 


FT 


CARBOHYD 
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615 


N-LINKED 


( GLCNAC . . . ) 


(POTENTIAL) . 


FT 


CARBOHYD 


744 


744 


N-LINKED 


(GLCNAC. . .) 


(POTENTIAL) . 


FT 


CARBOHYD 


995 


995 


N-LINKED 


( GLCNAC . . . ) 


(POTENTIAL) . 


SQ 


SEQUENCE 


1074 


AA; 117310 


MW; 4B81728C83CE52A7 CRC64; 



Query Match 92.9%; Score 39; DB 1; Length 1074; 

Best Local Similarity 85.7%; Pred. No. 1.2e-l-02; 

Matches 6; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MQPPPLP 7 

: I I I I I I 

Db 842 LQPPPLP 848 



RESULT 4 
ARP_ARATH 

ID ARP_ARATH STANDARD; PRT ; 536 AA. 

AC P45951; 

DT Ol-NOV-1995 (Rel. 32, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Apurinic endonuclease-redox protein ( DNA- (apurinic or apyrimidinic 

DE site) lyase) (EC 4 . 2 . 99 . 18 ) . 

GN ARP OR REF OR AT2G41460 OR T26J13.5. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; rosids; 



OC eurosids II; Brassicales; Brassicaceae; Arabidopsis. 

OX NCBI_TaxID=3702; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^cv. Columbia; 

RX MEDLINE=20083487; PubMed=l 0617 197 ; 

RA Lin X., Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town CD., 

RA Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V., 

RA Buell C.R., Ketchum K.A. , Lee J. J., Ronning CM., Koo H.L., 

RA Moffat K.S., Cronin L.A., Shen M. , Pai C, Van Aken S., Umayam L., 

RA Tallon L.J., Gill J.E., Adams M.D., Carrera A.J., Creasy T.H., 

RA Goodman H.M., Somerville CR., Copenhaver G.P., Preuss D., 

RA Nierman W.C, White O., Eisen J.A., Salzberg S.L., Eraser CM., 

RA Venter J.C ; 

RT "Sequence and analysis of chromosome 2 of the plant Arabidopsis 

RT thaliana."; 

RL Nature 402:761-768(1999). 

RN [2] 

RP SEQUENCE OF 10-536 FROM N.A. 

RC STRAIN=cv. Columbia; TISSUE=Callus ; 

RX MEDLINE=94211851; PubMed=7512729; 

RA Babiychuk E., Kushnir S., van Montagu M. , Inze D.; 

RT "The Arabidopsis thaliana apurinic endonuclease Arp reduces human 

RT transcription factors Fos and Jun."; 

RL Proc. Natl. Acad. Sci. U.S.A. 91:32 99-3303(1994). 

CC -!- FUNCTION: Repairs oxidative DNA damages, seems also to act as a 

CC redox factor. Is multifunctional and may be involved both in DNA 

CC repair and in the regulation of transcription. 

CC -!- CATALYTIC ACTIVITY: The C-O-P bond 3' to the apurinic or 

CC apyrimidinic site in DNA is broken by a beta-elimination reaction, 

CC leaving a 3 ' -terminal unsaturated sugar and a product with a 

CC terminal 5 * -phosphate . 

CC -!- SUBCELLULAR LOCATION: Nuclear. 

CC TISSUE SPECIFICITY: Expressed in the siliques, flowers, and stems. 

CC A high level expression is seen in the leaves. 

CC -!- SIMILARITY: Belongs to the DNA repair enzymes AP/exoA family. 

CC -!- SIMILARITY: Contains 1 SAP domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AC004625; 7\AC23731.1; -. 

DR EMBL; X7 6912; CAA54234.1; -. 

DR PIR; T02441; T02441. 

DR HSSP; P27695; 1E9N. 

DR InterPro; IPR000097; APendonclsel . 

DR InterPro; IPR005135; Exo_endo_phos . 

DR InterPro; IPR004B08; ExoIII_xth. 

DR InterPro; IPR003034; SAP. 

DR Pfam; PF03372; Exo_endo_phos ; 1. 

DR Pfam; PF02037; SAP; 1. 

DR SMART; SM00513; SAP; 1. 



DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



1. 
1. 
1. 



TIGRFAMs; TIGR00195; exoDNase_I II ; : 
TIGRFAMs; TIGR00633; xth; 1. 
PROSITE; PS 0 072 6; AP_NUCLEASE_F1_1 ; 
PROSITE; PS00727; AP_NUCLEASE_F1_2 ; 
PROSITE; PS00728; AP_NUCLEASE_F1_3 ; 
PROSITE; PS50800; SAP; 1. 
DMA repair; Lyase; Nuclear protein. 

DOMAIN 1 278 HIGHLY CHARGED; INCREASES THE AFFINITY OF 

ARP FOR DNA. 
DOMAIN 97 131 SAP. 

DOMAIN 27 9 536 AP ENDONUCLEASE . 

METAL 313 313 MAGNESIUM OR MANGANESE (BY SIMILARITY) . 

ACT_SITE 527 527 GENERAL BASE (BY SIMILARITY) . 

SEQUENCE 536 AA; 60260 MW; 5C1FC17EA991D2 7B CRC64; 



Query Match 90.5%; Score 38; DB 1; Length 536; 

Best Local Similarity 85.7%; Pred. No. 81; 

Matches 6; Conservative 1; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 



Db 



1 MQPPPLP 7 
I : I I I I I 
264 MRPPPLP 270 



RESULT 5 
SERO__GALME 

ID SERO_GALME STANDARD; PRT; 167 AA. 

AC 07 6192; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Seroin precursor (Silk 23 kDa glycoprotein). 

OS Galleria mellonella (Wax moth) , 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Pyraloidea; 

OC Pyralidae; Galleriinae; Galleria. 

OX NCBI_TaxID=7137; 

RN [1] 

RP SEQUENCE FROM N.A., AND SEQUENCE OF 18-31. 

RC TISSUE=Silk gland; 

RX MEDLINE=98288272; PubMed=9624 12 6 ; 

RA Zurovec M. , Yang C, Kodrik D., Sehnal F. ; 

RT "Identification of a novel type of silk protein and regulation of its 

RT expression."; 

RL J. Biol. Chem. 273:15423-15428(1998), 

CC -!- SUBCELLULAR LOCATION: Secreted. 

CC -!- TISSUE SPECIFICITY: PRODUCED BY BOTH THE POSTERIOR (PSG) AND 
CC MIDDLE (MSG) SECTIONS OF SILK GLANDS. 

CC -!- DEVELOPMENT7VL STAGE: Seroin mRNA is high in the silk glands of 
CC feeding larvae, declines at ecdysis, reaches a maximum during 

CC cocoon spinning, and thereafter rapidly drops to an undetectable 

CC level. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 



CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF009828; AAC25171.1; -. 

KW Silk; Glycoprotein; Signal; Repeat. 
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(GLCNAC. . .) (POTENTIAL) 


FT CARBOHYD 


146 
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N-LINKED 


(GLCNAC. . .) (POTENTIAL) 


SQ SEQUENCE 


167 AA; 


18088 


MW; 2 7A6ABE 


862774EB9 CRC64; 


Query Match 




88.1 


%; Score 37; 


DB 1; Length 167; 


Best Local Similarity 


100. 


0%; Pred. No 


. 34; 


Matches 6; 


Conservative 


0; Mismatches 0; Indels 0; 



Qy 2 QPPPLP 7 

I I I I I I 

Db 42 QPPPLP 47 



RESULT 6 
RGM1_ YEAST 

ID RGM1_YEAST STANDARD; PRT; 211 AA. 

AC Q00453; 

DT Ol-APR-1993 (Rel. 25, Created) 

DT Ol-OCT-1996 (Rel. 34, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE Probable transcription repressor protein RGMl . 

GN RGMl OR YMR182C OR YM8010.12C. 

OS Saccharomyces cerevisiae (Baker's yeast). 

OC Eukaryota; Fungi; Ascomycota; Saccharomycotina; Saccharomycetes ; 

OC Saccharomycetales ; Saccharomycetaceae; Saccharomyces. 

OX NCBI_TaxID=4932; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c; 

RX MEDLINE=9202 0118 ; PubMed===1923755 ; 

RA Estruch F. ; 

RT "The yeast putative transcriptional repressor RGMl is a proline-rich 

RT zinc finger protein."; 

RL Nucleic Acids Res. 19:4 873-4 877(1991). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=S288c / AB972; 

RX MEDLINE=97313268; PubMed-9169 872 ; 

RA Bowman S., Churcher CM., Badcock K., Brown D., Chillingworth T., 

RA Connor R., Dedman K., Devlin K., Gentles S., Hamlin N. , Hunt S., 

RA Jagels K., Lye G., Moule S., Odell C, Pearson D., Rajandream M.A., 

RA Rice P., Skelton J., Walsh S., Whitehead S., Barrell B.G.; 

RT "The nucleotide sequence of Saccharomyces cerevisiae chromosome 

RT XIII. "; 



RL 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 



Nature 387:90-93(1997). 

SUBCELLULAR LOCATION: Nuclear. 

INDUCTION: UNDER THE CONTROL OF THE INDUCIBLE GALIO PROMOTER. 
DOMAIN: THE PRO-RICH REGION OF RGMl ATTACHED TO A HETEROLOGOUS DNA 
BINDING DOMAIN IS ABLE TO REPRESS THE EXPRESSION OF THE TARGET 
GENE. 

-!- SIMILARITY: Contains 2 C2H2-type zinc fingers. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; X59861; CAA42521.1; -. 
EMBL; Z49808; CAA89915.1; ~. 
PIR; S55129; S55129. 
HSSP; P07248; 2ADR. 
GermOnline; 142856; 
TRANSFAC; T00730; 
SGD; S0004794; RGMl. 
GO; GO: 0005634; C:nucleus; IC. 



GO; GO: 0003702; F:RNA polymerase II transcription factor acti . 

GO; GO: 0000122; P:negative regulation of transcription from P. 

InterPro; IPR007087; Znf_C2H2. 

Pfam; PF00096; zf-C2H2; 2. 

SMART; SM00355; ZnF_C2H2 ; 2. 

PROSITE; PS00028; ZINC_FINGER_C2H2_1 ; 1. 

PROSITE; PS50157; ZINC_FINGER_C2H2_2 ; 2. 

Transcription regulation; Repressor; DNA-binding; Zinc-finger; 
Metal-binding; Nuclear protein; Repeat. 
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FT 


DOMAIN 
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PRO-RICH. 


FT 


CONFLICT 


114 


114 


A -> V (IN REF. 1) . 


SQ 


SEQUENCE 


211 AA; 


23855 


MW; FADCE4757FE7D372 CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 211; 

Best Local Similarity 100.0%; Pred. No. 43; 

Matches 6; Conservative 0; Mismatches 0; Indels 



0; Gaps 



0; 



Qy 

Db 



2 QPPPLP 7 
I I I I I I 
154 QPPPLP 159 



RESULT 7 
TNF6_RAT 

ID TNF6_RAT STANDARD; PRT; 278 AA. 

AC P36940; 

DT Ol-JUN-1994 (Rel. 29, Created) 

DT Ol-JUN-1994 (Rel. 29, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Tumor necrosis factor ligand super family member 6 (FAS antigen 



DE ligand) . 

GN TNFSF6 OR FASL OR APTILGI . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Maimnalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI__TaxID-10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=94 084 7 92; PubMed=75052 05 ; 

RA Suda T., Takahashi T., Golstein P., Nagata S.; 

RT "Molecular cloning and expression of the Fas ligand^. a novel member 

RT of the tumor necrosis factor family."; 

RL Cell 75:1169-1178(1993). 

CC -!- FUNCTION: Cytokine that binds to TNFRSF6/FAS, a receptor that 
CC transduces the apoptotic signal into cells. May be involved in 

CC cytotoxic T cell mediated apoptosis and in T cell development. 

CC TNFRSF6/ FAS -mediated apoptosis may have a role in the induction of 

CC peripheral tolerance, in the antigen-stimulated suicide of mature 

CC T cells, or both. Binding to the decoy receptor TNFRSF6B/DcR3 

CC modulates its effects (By similarity) . 

CC SUBUNIT: Homotrimer (Probable). 

CC -!- SUBCELLULAR LOCATION: Type II membrane protein and secreted (By 
CC similarity) . 

CC -!- TISSUE SPECIFICITY: EXPRESSED IN ACTIVATED SPLENOCYTES AND 

CC THYMOCYTES, MODERATE OR WEAK EXPRESSION FOUND IN SMALL INTESTINES, 

CC KIDNEY AND LUNG. 

CC -!- INDUCTION: BY PMA/lONOMYCIN AND CONCAVALIN/ INTERLEUKIN-2 . 

CC -!- PTM: The soluble form derives from the membrane form by 
CC proteolytic processing (By similarity) . 

CC SIMILARITY: Belongs to the tumor necrosis factor family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U03470; AAC52129.1; -. 

DR PIR; A49266; A49266. 

DR HSSP; P01375; 4TSV. 

DR InterPro; IPR008064; Fas_ligand. 

DR InterPro; IPR006053; TNF_abc. 

DR InterPro; IPR006052; TNF_family. 

DR InterPro; IPR008983; TNF_like. 

DR InterPro; IPR003636; TNF_subf. 

DR Pfam; PF00229; TNF; 1. 

DR PRINTS; PR01681; FASLIGAND. 

DR PRINTS; PR01234; TNECROSISFCT . 

DR ProDom; PD002012; TNF_subf; 1. 

DR SMART; SM00207; TNF; 1. 

DR PROSITE; PS00251; TNF_1; 1. 

DR PROSITE; PS50049; TNF_2 ; 1. 

KW Cytokine; Apoptosis; Transmembrane; Glycoprotein; Signal-anchor. 

FT CHAIN 1 278 TUMOR NECROSIS FACTOR LIGAND SUPERFAMILY 
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SQ 


SEQUENCE 


278 AA; 


31140 


MW; 2898E18A862CEAC6 CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 278; 

Best Local Similarity 100.0%; Pred. No. 57; 

Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 2 QPPPLP 7 

I I I I I I 

Db 60 QPPPLP 65 



0; Gaps 



0; 



RESULT 8 
PEXD__CAEEL 

ID PEXD_CAEEL STT^DARD; PRT; 330 AA. 

AC Q19951; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Probable peroxisomal membrane protein PEX13 (Peroxin-13) . 

GN F32A5.6. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 

OC Rhabditidae; Peloderinae; Caenorhabditis. 

OX NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Bristol N2 ; 

RA Pauley A. ; 

RL Submitted (MAR-1995) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP REVISIONS . 

RA Waterston R. ; 

RL Submitted (JUN-2001) to the EMBL/GenBank/DDBJ databases. 

CC -!- FUNCTION: Component of the peroxisomal translocation machinery 

CC with PEX14 and PEX17. Functions as a docking factor for the 

CC predominantly cytoplasmic PTSl receptor (PAS10/PEX5) (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. Peroxisomal (By 
CC similarity) . 

CC -!- SIMILARITY: Contains 1 SH3 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http : //www. isb-sib , ch/announce/ 

CC or send an email to license@isb-sib , ch) . 

CC _ 

DR EMBL; U20864; AAC46665.2; 

DR HSSP; P00519; lABL . 

DR WormPep; F32A5.6; CE2 82 90. 

DR InterPro; IPR007223; Peroxin-13_N . 

DR InterPro; IPR001452; SH3 . 

DR Pfam; PF04088; Peroxin-13_N; 1. 

DR Pfam; PF00018; SH3; 1. 

DR PRINTS; PR00452; SH3D0MAIN. 

DR ProDom; PD000066; SH3; 1. 

DR SMART; SM00326; SH3; 1. 

DR PROSITE; PS50002; SH3 ; 1. 

KW Peroxisome; Transport; Protein transport; Transmembrane; SH3 domain. 

FT DOMAIN 1 192 LUMENAL (POTENTIAL). 

FT TRANSMEM 193 213 POTENTIAL. 

FT DOMAIN 214 330 CYTOPLASMIC (POTENTIAL) . 

FT DOMAIN 235 30 0 SH3 . 

SQ SEQUENCE 330 AA; 35765 MW; 7DDD64 7 F07A04 4 03 CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 330; 

Best Local Similarity 100.0%; Pred. No. 68; 

Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 2 QPPPLP 7 

MINI 

Db 8 QPPPLP 13 



0; Gaps 



0; 



RESULT 9 
WR25_ARATH 

ID WR25__7VRATH STANDARD; PRT; 393 AA. 

AC 022 921; Q94AT4; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Probable WRKY transcription factor 25 (WRKY DNA-binding protein 25) . 

GN WRKY25 OR AT2G30250 OR T9D9.6. 

OS Arabidopsis thaliana (Mouse-ear cress) . 

OC Eukaryota; Viridiplantae ; Streptophyta ; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta; eudicotyledons ; core eudicots; rosids; 

OC eurosids II; Brassicales; Brassicaceae ; Arabidopsis. 

OX NCBI_TaxID=3702 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^^cv. Columbia; TISSUE^Flower ; 

RA Ulker B., Kushnir S., Somssich I.E.; 

RT "Arabidopsis thaliana transcription factor WRKY25."; 

RL Submitted (SEP-2001) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=cv. Columbia; 

RX MEDLINE=20083487; PubMed=10617197 ; 

RA Lin X,, Kaul S., Rounsley S.D., Shea T.P., Benito M.-I., Town CD., 



RA 

RA 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

RN 

RP 

RC 

RX 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RA 

RT 

RT 

RL 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 

CC 



Fujii C.Y., Mason T.M., Bowman C.L., Barnstead M.E., Feldblyum T.V 
Buell C.R,, Ketchum K.A. , Lee J. J. , Ronning CM., Koo H.L., 
Moffat K.S., Cronin L.A., Shen M. , Pai G. , Van Aken S., Umayam L., 
Tallon L.J., Gill J.E., Adams M.D., Carrera A. J., Creasy T.H., 
Goodman H.M., Somerville C.R., Copenhaver G.P., Preuss D. , 
Nierman W.C., White 0., Eisen J. A., Salzberg S.L., Eraser CM., 
Venter J.C ; 

"Sequence and analysis of chromosome 2 of the plant Arabidopsis 
thaliana . " ; 

Nature 402:761-768(1999). 
[3] 

SEQUENCE FROM N.A. 
STRAIN=cv. Columbia; 
MEDLINE=22954850; PubMed=14593172 ; 
Yamada K. , Lim J., Dale J.M., Chen H 
Southwick A.M., Wu H.C, Kim CJ., 
Karlin-Newmann G., Liu S.X., Lam B 



Shinn P. , Palm C J. , 
Nguyen M. , Pham P.K., Cheuk R.F., 
, Sakano H., Wu T., Yu G., 



Miranda M. , Quach H.L., Tripp M. , Chang CH., Lee J.M., Toriumi M.J., 
Chan M.M., Tang CC, Onodera C.S., Deng J.M., Akiyama K. , Ansari Y. , 
Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P., 
Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M. , Hansen N.F., 
Hayashizaki Y., Johnson-Hopson C, Hsuan V.W., lida K. , Karnes M. , 
Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J., 
Kamiya A., Meyers C, Nakajima M., Narusaka M. , Seki M., Sakurai T., 
Satou M., Tamse R. , Vaysberg M., Wallender E.K,, Wong C, Yamamura Y., 
Yuan S., Shinozaki K. , Davis R.W., Theologis A., Ecker J.R.; 
"Empirical analysis of transcriptional activity in the Arabidopsis 
genome . " ; 

Science 302:842-846(2003). 

-!- FUNCTION: Transcription factor. Interacts specifically with the W 
box (5'-(T)TGAC(C/T)-3') , a frequently occurring elicitor- 
responsive cis-acting element (By similarity) . 
SUBCELLULAR LOCATION: Nuclear (Probable) . 
SIMILARITY: Belongs to the WRKY group I family. 
SIMILARITY: Contains 2 WRKY domains. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license{3isb-sib . ch) . 



CC 
DR 


EMBL; 


AF418309; 


AAL13040 


1 




DR 


EMBL; 


AC002338; 


AAC16930 


1 




DR 


EMBL ; 


AC004165; 


AAM14918 


1 




DR 


EMBL; 


AY062720; 


AAL32798 


1 




DR 


EMBL; 


AY114650; 


AAM47969 


1 




DR 


EMBL; 


AY136318; 


AAM96984 


1. 




DR 


EMBL; 


AY045813; 


AAK76487 


2, 




DR 


EMBL; 


BT002338; 


AAN86171 


1, 





DR PIR; T00575; T00575. 

DR InterPro; IPR003657; WRKY. 

DR Pfam; PF03106; WRKY; 2. 

DR PROSITE; PS50811; WRKY; 2. 

KW Transcription regulation; DNA-binding; Nuclear protein; Repeat. 



FT DNA_BIND 160 224 WRKY 1, 

FT DNA_BIND 322 387 WRKY 2. 

SQ SEQUENCE 393 AA; 44133 MW; 9057D65B69E12A0C CRC64; 

Query Match 88.1%; Score 37; DB 1; Length 393; 

Best Local Similarity 100.0%; Pred. No. 82; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 50 QPPPLP 55 



RESULT 10 
B1AR__MELGA 

ID B1AR_MELGA STANDARD; PRT; 483 7^. 

AC P07700; 

DT Ol-APR-1988 (Rel. 07, Created) 

DT Ol-APR-1988 (Rel. 07, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Beta-1 adrenergic receptor (Beta-T) . 

OS Meleagris gallopavo (Common turkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Galliformes; Phasianidae; Meleagris, 

OX NCBI_TaxID=9103; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=86313664 ; PubMed=301874 6; 

RA Yarden Y., Rodriguez H., Wong S.K.-F., Brandt D.R., May D.C., 

RA Burnier J., Harkins R.N., Chen E.Y., Ramachandran J., Ullrich A., 

RA Ross E.M. ; 

RT "The avian beta-adrenergic receptor: primary structure and membrane 

RT topology."; 

RL Proc. Natl. Acad. Sci. U.S.A. 83:6795-6799(1986). 

RN [2] 

RP STRUCTURE BY NMR OF 345-359. 

RX MEDLINE=95129696; PubMed=7828722 ; 

RA Jung H., Windhaber R. , Palm D. , Schnackerz K.D.; 

"NMR and circular dichroism studies of synthetic peptides derived 
from the third intracellular loop of the beta-adrenoceptor . " ; 

RL FEBS Lett. 358:133-136(1995). 

CC -!- FUNCTION: BETA-ADRENERGIC RECEPTORS MEDIATE THE CAT ECHO LAMINE- 
CC INDUCED ACTIVATION OF ADENYLATE CYCLASE THROUGH THE ACTION OF G 

CC PROTEINS. THIS RECEPTOR BINDS EPINEPHRINE AND NOREPINEPHRINE WITH 

CC APPROXIMATIVELY EQUAL AFFINITY. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC PTM: Homologous desensitization of the receptor is mediated by its 

OC phosphorylation by beta-adrenergic receptor kinase. 

CC -!- SIMILARITY: Belongs to family 1 of G-protein coupled receptors. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



RT 
RT 



cc 

no 
UK 


EMBL; M14379; AAA49627.1; 




UK 


PIR; A25896; A258 


96. 




HE) 

UK 


PDB; IDEP; 14-OCT 


-96. 




JJr\ 


InterPro; 


IPR000276; GPCR Rhodpsn. 


JJK 


Pfam; PFOOOOl; 7tm 1; 1. 




DR 


PRINTS; PR00237; 


GPCRRHODOPSN. 


UK 


PROSITE; 


PS00237; 


G_PROTEIN_ 


_RECEP_F1_1; 1. 


UK 


PROSITE; 


PS50262; 


G_PROTEIN_ 


'rECEP_F1_2 ; 1 . 


J\W 


G-protein 


coupled 


receptor; 


Transmembrane; Glycoprotein; 


n.vV 


Multigene 


family; 


Phosphorylation; Lipoprotein; Palmitate; 


J\W 


3D-structure . 






r i 




1 
j_ 


43 


EXTRACELLULAR (POTENTIAL) . 


Tjirri 

r 1 


i KAN bMhjM 


A A 

44 


6 / 


1 (POTENTIAL) . 


TTTt 

r i 




\J o 


11 


CYTOPLASMIC (POTENTIAL) . 


r 1 


TRANSMEM 


78 




2 ( POTENTIAL) . 


FT 


DOMAIN 


95 


115 


EXTRACELLULAR (POTENTIAL) . 


r 1 


TRANSMEM 


116 


137 


3 (POTENTIAL) . 


T7>rn 

r i 


DOMAIN 


138 


159 


CYTOPLASMIC (POTENTIAL) . 


T. '1 1 1 

r 1 


TRANSMEM 


160 


189 


4 (POTENTIAL) . 


FT 


DOMAIN 


190 


205 


EXTRACELLULAR (P0TENTI7VL) . 


T."P 

r 1 


TRANSMEM 


206 


228 


5 (POTENTIAL) . 


r 1 


DOMAIN 


229 


290 


CYTOPLASMIC (POTENTIAL) . 


FT 


TRANSMEM 


291 


315 


6 (POTENTIAL) . 


r 1 


DOMAIN 


316 


321 


EXTRACELLULAR (POTENTIAL) . 


FT 


TRTVNSMEM 


322 


344 


7 (POTENTIAL) . 


r J. 


DOMAIN 


345 


483 


CYTOPLASMIC (POTENTIAL) . 


FT 


CARBOHYD 


14 


14 


N-LINKED (GLCNAC. . .) (PROBABLE). 


FT 


DISULFID 


114 


192 


BY SIMILARITY. 


FT 


LIPID 


358 


358 


S-palmitoyl cysteine (By similarity; 


FT 


HELIX 


347 


358 


SQ 


SEQUENCE 


483 AA; 


54078 MW 


; B11A7E71F6CCE3E4 CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 483; 

Best Local Similarity 100.0%; Pred. No. le+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 256 QPPPLP 261 



RESULT 11 
ILVD_PYRAE 

ID ILVD_PYRAE STANDARD; PRT; 562 AA. 

AC Q8ZYU6; 

DT 28-FEB-2003 (Rel. 41, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Dihydroxy-acid dehydratase (EC 4.2.1.9) (DAD). 

GN ILVD OR PAE0615. 

OS Pyrobaculum aerophilum. 

OC Archaea; Crenarchaeota ; Thermoprotei ; Thermoproteales ; 

OC Thermoproteaceae; Pyrobaculum. 

OX NCBI_TaxID=13773; 
RN [1] 

RP SEQUENCE FROM N.A. 



RC STRAIN-IM2 / ATCC 51768 / DSM 7523; 
RX MEDLINE=21664397; PubMed-117 92 8 69 ; 

RA Fitz-Gibbon S.T., Ladner H., Kim U.-J., Stetter K.O., Simon M.I., 
RA Miller J.H. ; 

RT "Genome sequence of the hyperthermophilic crenarchaeon Pyrobaculum 
RT aerophilum. 

RL Proc. Natl. Acad. Sci. U.S.A. 99:984-989(2002). 

CC CATALYTIC ACTIVITY: 2 , 3-dihydroxy-3-methylbutanoate = 3-methyl-2- 

CC oxobutanoate + H(2)0. 

COFACTOR: Binds 1 4Fe-4S cluster (Potential) . 
PATHWAY: Valine and isoleucine biosynthesis; fourth step. 
SIMILARITY: Belongs to the ilvD / edd family. 



CC 
CC 
CC 
CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(3isb-sib . ch) . 

CC 

DR EMBL; AE009775; AAL62897.1; -. 

DR HAMAP; MF_00012; -; 1. 

DR InterPro; IPR004404; IlvD. 

DR InterPro; IPR000581; ILVD_EDD_f amily . 

DR Pfam; PF00920; ILVD_EDD; 1. 

DR ProDom; PD002691; ILVD_EDD_f amily ; 1. 

DR TIGRFAMs; TIGROOllO; ilvD; 1. 

DR PROSITE; PS00886; ILVD_EDD_1; 1. 

DR PROSITE; PS00887; ILVD_EDD_2; 1. 

KW Branched-chain amino acid biosynthesis; Lyase; Iron; Iron-sulfur; 

KW 4Fe-4S; Complete proteome. 

FT METAL 124 124 IRON-SULFUR (4FE-4S) (POTENTIAL) . 

FT METAL 196 196 IRON-SULFUR (4FE-4S) (POTENTIAL) . 

SQ SEQUENCE 562 AA; 59751 MW; 1D4DF358B14FC8 66 CRC64; 

Query Match 88.1%; Score 37; DB 1; Length 562; 

Best Local Similarity 100.0%; Pred. No. 1.2e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 358 QPPPLP 363 



RESULT 12 
EXTN_TOBAC 

ID EXTN_TOBAC STT^NDARD; PRT; 620 AA. 

AC P13983; 

DT Ol-JAN-1990 (Rel. 13, Created) 

DT Ol-JAN-1990 (Rel. 13, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE Extensin precursor (Cell wall hydroxyproline-rich glycoprotein) . 
GN HRGPNT3 . 

OS Nicotiana tabacum (Common tobacco) . 

OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta ; 

OC Spermatophyta; Magnoliophyta ; eudicotyledons ; core eudicots; asterids; 



OC lamiids; Solanales; Solanaceae; Nicotiana. 
OX NCBI_TaxlD=4097; 
RN [1] 

RP SEQUENCE FROM N.A. 
RC STRAIN=cv. Xanthi; TISSUE^Leaf; 
RX MEDLINE-90128263; PubMed=2 612 9 09 ; 
RA Keller B., Lamb C.J,; 

"Specific expression of a novel cell wall hydroxyproline-rich 
glycoprotein gene in lateral root initiation."; 
RL Genes Dev. 3:1639-164 6(1989). 

CO FUNCTION: Has a specialized structural function, possibly in 

the mechanical penetration of the cortex and epidermis of the 
CC main root. 

CC -!- SUBCELLULAR LOCATION: Extracellular matrix. 

CC -!- PTM: EXTENSINS CONTAIN A CHARACTERISTIC REPEAT OF THE PENTAPEPTIDE 
CC SER-PR0{4). THE PROLINE RESIDUE IS HYDROXYLATED AND THEN 

CC GLYCOSYLATED. 



RT 
RT 



CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license0isb-sib . ch ) . 

CC 

DR EMBL; X13885; CAA32090.1; -. 



DR 


PIR; S06733; S06733. 




DR 


InterPro 


; IPR0004 


80; Glutelin. 


DR 


PRINTS; 


PR00211; 


GLUTELIN 




KW 


Repeat; 


Cell wall 


; Glycoprotein; Signal; Structural protein; 


KW 


Hydroxylation . 






FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


62 0 


EXTENSIN. 


FT 


REPEAT 


70 


73 


H-A-P-P. 


FT 


REPEAT 


148 


151 


H-A-P-P. 


FT 


DOMAIN 


229 


242 


2 X 7 AA TANDEM REPEATS OF T-H-R- 


FT 


REPEAT 


229 


235 


1. 


FT 


REPEAT 


236 


242 


2. 


FT 


DOMAIN 


205 


62 0 


CONTAINS THE SER-PR0(4) REPEATS. 


FT 


DOMAIN 


499 


600 


3 X APPROXIMATE TANDEM REPEATS. 


SQ 


SEQUENCE 


620 AA 


; 65406 


MW; 641DD2278AB28524 CRC64 ; 



-P. 



Query Match 88.1%; Score 37; DB 1; Length 620; 

Best Local Similarity 100.0%; Pred. No. 1.3e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I M 

Db 425 QPPPLP 430 



RESULT 13 
DREB_HUMAN 

ID DREB_HUMAN STANDARD; PRT ; 649 AA. 

AC Q16643; 

DT Ol-NOV-1997 (Rel. 35, Created) 



DT Ol-NOV-1997 (Rel. 35, Last sequence update) 

DT lO-OCT-2003 (Rel. 42, Last annotation update) 

DE Drebrin (Developmentally regulated brain protein) . 

GN DBNl . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Osteoblast; 

RA Fisher L.W., McBride O.W., Filpula D., Ibaraki K. , Young M.F.; 

RT "Human drebrin: cDNA sequence, mRNA tissue distribution and 

RT chromosomal localization."; 

RL Neurosci. Res. Commun. 14:35-42(1994). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Fetal brain; 

RX MEDLINE=94030036; PubMed=8216329; 

RA Toda M. , Shirao T., Minoshima S., Shimizu N. , Toya S., Uyemura K. ; 

RT "Molecular cloning of cDNA encoding human drebrin E and chromosomal 

RT mapping of its gene."; 

RL Biochem. Biophys . Res. Commun. 196:468-472(1993). 

RN [3] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Eye, and Muscle; 

RX MEDLINE=22388257; PubMed-12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M. , Hong L., 

RA Stapleton M. , Scares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J.A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W. , Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2 002). 

CC -!- FUNCTION: Drebrins might play some role in cell migration, 

extension of neuronal processes and plasticity of dendrites, 
CC respectively. Binds F-actin. 

CC SUBCELLULAR LOCATION: Cytoplasmic. 

CC -!- TISSUE SPECIFICITY: Brain neurons. 7\lso found in the heart, 
CC placenta, skeletal muscle, kidney and pancreas. 

GC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



cc use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U00802; AAA16256.1; 

DR EMBL; D17530; BAA04480.1; -. 

DR EMBL; BC000283; AAH00283.1; -. 

DR EMBL; BC007281; AAH07281.1; -. 

DR EMBL; BC007567; AAH07567.1; -. 

DR PIR; JN0809; JN0809. 

DR Genew; HGNC:2695; DBNl. 

DR MIM; 126660; 

DR GO; GO: 0015629; C:actin cytoskeleton; TAS . 

DR GO; GO: 0003779; F:actin binding; TAS. 

DR InterPro; IPR002108; Actbind^cof In . 

DR Pfam; PF00241; cofilin_ADF; 1. 

DR SMART; SM00102; ADF; 1. 

KW Actin-binding; Neurone. 

SQ SEQUENCE 649 AA; 71425 MW; A17 30E7C5FC32D8 8 CRC64; 

Query Match 88.1%; Score 37; DB 1; Length 649; 

Best Local Similarity 100.0%; Fred. No. 1.4e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 2 QPPPLP 7 

I I I I I I 

Db 363 QPPPLP 368 



RESULT 14 
AXN_DROME 

ID AXN_DROME STANDARD; PRT; 745 AA. 

AC Q9V407; Q9XYC1; 

DT 16-OCT-2001 (Rel. 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Axin (Axis inhibition protein) (dAxin) (d-Axin) . 

GN AXN OR CG7926. 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae ; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Embryo; 

RX MEDLINE=99174088; PubMed=1007 394 0 ; 

RA Hamada F. , Tomoyasu Y. , Takatsu Y., Nakamura M. , Nagai S.-I., 

RA Suzuki A., Fujita F. , Shibuya H., Toyoshima K. , Ueno N., Akiyama T. ; 

RT "Negative regulation of Wingless signaling by D-axin, a Drosophila 

RT homolog of axin.'*; 

RL Science 283:1739-1742(1999). 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Ruel L., Anthopoulos N., Goncalves J., Manoukian A. S . , Woodgett J.R.; 

RT "A Drosophila homolog of the axin gene is involved in the transduction 



RT of the wingless signal regulating the stability of the armadillo 
RT protein."; 

RL Submitted (SEP-1998) to the EMBL/GenBank/DDBJ databases. 
RN [3] 

RP SEQUENCE FROM N.A. 
RC STRAIN=Berkeley; 

RX MEDLINE-20196006; PubMed=107 3 1132 ; 

RA Adams Celniker S.E., Holt R.A. , Evans C.A., Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G.G., Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champa M. , Pfeiffer B.D., 

RA Wan K.H., Doyle Baxter E.G., Helt G., Nelson C.R., Miklos G.L.G., 

RA Abril J.F,, Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D., Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A., Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M., Cawley S., Dahlke C, Davenport L.B., Davies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K. , Doup L.E., Downes M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W. , 

RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K., 

RA Glodek A., Gong F., Gorrell J.H., Gu Z., Guan P., Harris M. , 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J., 

RA Hostin D., Houston K.A. , Howland T.J., Wei M.-H., Ibegwam 

RA Jalali M. , Kalush F., Karpen G.H., Ke Z., Kennison J.A. , Ketchum K.A. , 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A.A. , Li J.H., Li Z., Liang Y., Lin X., 

RA Liu X., Mattel B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina Mobarry C. , Morris J., Moshrefi A., 

RA Mount S.M., Moy M., Murphy B., Murphy L. , Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A. , Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M. , Pittman G.S., Pan S., Pollard J., Purl V., Reese M.G., 

RA Reinert K. , Remington K. , Saunders R.D.C., Scheeler F., Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E., Spradling A.C., Stapleton M. , Strong R. , Sun E., 

RA Svirskas R., Tector C, Turner R. , Venter E. , Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G., Zhao Q., Zheng L. , 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster . " ; 

RL Science 287:2185-2195(2000). 

RN [4] 

RP FUNCTION. 

RX MEDLINE-99387984; PubMed=10457 025 ; 

RA Willert K., Logan C.Y., Arora A., Fish M., Nusse R. ; 

RT "A Drosophila Axin homolog, Daxin, inhibits Wnt signaling."; 

RL Development 12 6:4165-4173(1999). 

CC FUNCTION: Inhibitor of the WG signaling pathway. Down-regulates 

CC beta-catenin (armadillo=ARM) . Probably facilitate the 

CC phosphorylation of beta-catenin and APC by GSK3B (zeste-white 

CC 3=ZW3) . 

CC -!- SUBUNIT: Interacts with ZW3 and ARM. The interaction between AXN 
^nd ARM occurs via the armadillo repeats contained in ARM. 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
FT 
FT 
FT 
FT 
FT 
SQ 



- ! ■ 



SUBCELLULAR LOCATION: Cytoplasmic (By similarity). 
DEVELOPMENTAL STAGE: Ubiquitously expressed throughout the 
development. 
SIMILARITY: Contains 
SIMILARITY: Contains 



1 RGS domain. 
1 DIX domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AF086811; AAD24886.1; 
EMBL; AF091813; AAF21293.1; 
EMBL; AE003772; 7^F56993.1; 
FlyBase; FBgn0026597; Axn , 

GO; 00:0007455; P : eye-antennal disc metamorphosis; IMP. 

GO; GO: 0016055; P:Wnt receptor signaling pathway; IGI. 

InterPro; IPR001158; DIX. 

InterPro; IPR000342; Regl_Gprotein . 

Pfam; PF00778; DIX; 1. 

ProDom; PD003639; DIX; 1. 

ProDom; PD001580; Regl_Gprotein; 1. 

SM7VRT; SM00315; RGS; 1. 

PROSITE; PS50841; DIX; 1. 

PROSITE; PS50132; RGS; 1. 

Developmental protein; Wnt signaling pathway. 
DOMAIN 
DOMAIN 
DOMAIN 
CONFLICT 
CONFLICT 
SEQUENCE 



54 


172 


RGS. 


640 


646 


POLY-SER. 


663 


745 


DIX. 


454 


454 


R -> Q (IN REF. 1) . 


644 


645 


MISSING (IN REF. 1) . 


745 AA; 


81718 


MW; 31A502528CEE84BA CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 745; 

Best Local Similarity 100.0%; Pred. No. 1.6e+02; 
Matches 6; Conservative 0; Mismatches 0; Indels 

Qy 2 QPPPLP 7 

I M I I I 

Db 655 QPPPLP 660 



0; Gaps 



0; 



RESULT 15 
DYN1_CAEEL 

ID DYN1_CAEEL STANDARD; PRT; 830 AA. 

AC P39055; Q93176; Q95QY9; 

DT Ol-FEB-1995 (Rel. 31, Created) 

DT 28-FEB-2003 (Rel. 41, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE Dynamin (EC 3.6.1.50). 

GN DYN-1 OR C02C6.1. 

OS Caenorhabditis elegans . 

OC Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; Rhabditoidea ; 
OC Rhabditidae; Peloderinae; Caenorhabditis. 



ox NCBI_TaxID=6239; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM A) . 

RC STRAIN-Bristol N2 ; 

RX MEDLINE=97439883; PubMed=92 94229 ; 

RA Clark S.G., Shurland D.L., Meyerowitz E.M., Bargmann C.I., 

RA van der Bliek A.M.; 

RT "A dynamin GTPase mutation causes a rapid and reversible temperature- 

RT inducible locomotion defect in C. elegans."; 

RL Proc. Natl. Acad. Sci. U.S.A. 94:10438-10443(1997). 

RN [2] 

RP REVISIONS TO C-TERMINUS . 

RA van der Bliek A.M. ; 

RL Submitted (JUL-1999) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORMS A AND B) . 

RC STRAIN=Bristol N2 ; 

RA Swinburne J. ; 

RL Submitted (AUG-1996) to the EMBL/GenBank/DDB J databases. 

CC FUNCTION: Microtubule-as sociated force-producing protein involved 

CC in producing microtubule bundles and able to bind and hydrolyze 

GTP. Most probably involved in vesicular trafficking processes, in 
CC particular endocytosis. 

CC CATALYTIC ACTIVITY: GTP + H(2)0 = GDP + phosphate. 

CC -!- SUBCELLULAR LOCATION: Microtubule-associated . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=a; 

CC IsoId=P39055-l; Sequence=Displayed; 

CC Name^b ; 

CC IsoId=P39055-2; Sequence-VSP_001329 ; 

CC Note=No experimental confirmation available; 

CC SIMILARITY: Belongs to the dynamin family. 

CC -!- SIMILARITY: Contains 1 PH domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license(3isb-sib . ch) . 

CC 

DR EMBL; L29031; AAB72228.2; -. 

DR EMBL; Z79596; CAB01857.1; 

DR EMBL; Z79596; CAC42251.1; 

DR PIR; T18860; T18860, 

DR HSSP; Q05193; 2DYN. 

DR WormPep; C02C6.1a; CE07833. 

DR WormPep; C02C6.1b; CE07832. 

DR InterPro; IPR001401; Dynamin. 

DR InterPro; IPR000375; Dynamin_central . 

DR InterPro; IPR003130; GED. 

DR InterPro; IPR001849; PH. 

DR Pfam; PF00350; dynamin; 1. 

DR Pfam; PF01031; dynamin_2; 1. 

DR Pfam; PF02212; GED; 1. 



DR 


Pfam; PF00169; PH; 


1. 






DR 


PRINTS; 


PRO 01 95; DYNAMIN. 






DR 


SMART; SM00053; DYNc; 1. 






DR 


SMART; SM0 03 02; GED ; 1. 






DR 


SMART; SM0 02 33; PH 


; 1. 






DR 


PROSITE; 


PS00410; 


DYNAMIN; 1. 




DR 


PROSITE; 


PS50003; 


PH DOMAIN; 


1. 


PCW 


Hydrolase; Motor protein; 


GTP-binding; Microtubule; Multigene family; 


KW 


Endocytosis ; Alternative 


splicing . 


FT 


NP_BIND 


40 


47 




GTP (BY SIMILARITY) . 


FT 


NP_BIND 


138 


142 




GTP (BY SIMILARITY) . 


FT 


NP_BIND 


207 


210 




GTP (BY SIMILARITY) . 


FT 


DOMAIN 


519 


624 




PH. 


FT 


VARSPLIC 


817 


830 




PGPGGPPPNMAPPR -> VPTPSNGAPEI PARPQVPKRPF 


FT 










(in isoform b) . 


FT 










/FTId=VSP_001329. 


FT 


CONFLICT 


734 


734 




R -> P (IN REF. 1) . 


SQ 


SEQUENCE 


830 AA; 


93407 


MW; 


FF681250E51AB8A5 CRC64; 



Query Match 88.1%; Score 37; DB 1; Length 830; 

Best Local Similarity 100.0%; Pred. No. 1.8e+02; 

Matches 6; Conservative 0; Mismatches 0; Indels 0; Gaps 

Qy 2 QPPPLP 7 

M I I I I 

Db 747 QPPPLP 752 



Search completed: August 24, 2004, 15:43:22 
Job time : 9.76119 sees 



